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Abstract 

Relational  Semantics  can  be  used  to  give  a  denotation  to  the  non-disambiguated  logical 
forms  used  by  Natural  Language  Processing  systems,  representations  in  which  the  quanti¬ 
fiers  are  left  “in  situ”.  Giving  a  semantics  to  these  logical  forms  makes  it  unnecessary  for 
the  system  to  compute  all  the  disambiguated  interpretations  of  a  sentence  before  storing 
its  representation  in  the  knowledge  base.  Rules  of  inference  can  be  defined  so  that  the 
disambiguation  process  can  be  formally  modeled  in  a  declarative  way.  ‘Weaker’  rules  of  in¬ 
ference  can  also  be  specified  so  that  conclusions  can  be  derived  from  the  non-disambiguated 
representation. 


*This  technical  report  is  based  on  my  thesis  proposal.  Parts  of  this  work  will  appear  in  the  proceedings 
of  the  Second  Conference  on  Situation  Semantics  and  Its  Applications,  Loch  Rannoch,  Scotland,  1990. 

‘This  work  was  partially  supported  by  ONR/DARPA  research  contract  no.  N00014-82-K-0193,  and  in 
part  by  Air  Force;  Rome  Laboratory  research  contract  no.  P30602-91-C-0010 


□  □ 


SECURITY  CLASSIFICATION  OF  THIS  RAGE  r«hMi  Dmtm  En(*r*« 


REPORT  DOCUMENTATION  PAGE 


REPORT  NUMBER 


TR  389 


4.  Title  (and  SabtllU) 

Scope  Ambiguity  and  Inference 


READ  INSTRUCTIONS 

_ before  completing  form 


2.  GOVT  ACCESSION  NOJ  3.  RECIPIENT'S  CATALOG  NUMBER 


8.  TYPE  OF  REPORT  A  PERIOD  COVERED 

technical  report 


S.  PERFORMING  ORG.  REPORT  NUMBER 


7.  AUTHORfaJ 

Massimo  Poesio 


S.  contract  or  grant  HUMBERT*; 

N00014-82-K-0193 


9.  PERFORMING  ORGANIZATION  NAME  AND  ADDRESS 


to.  PROGRAM  ELEMENT.  PROJECT,  TASK 
AREA  A  WORK  UNIT  NUMBERS 


Computer  Science  Dept. 

University  of  Rochester,  734  Comp.  Studies  Bldg. 
Rochester,  NY,  14627,  USA 


11.  CONTROLLING  OFFICE  NAME  AND  AOORESS 

Defense  Advanced  Research  Projects  Agency 
1400  Wilson  Blvd. 

Arlington,  VA  22209 


14.  MONITORING  AGENCY  name  A  AODRESSTi/ dllfcrant  from  CofitrollinS  Offle*;  IS.  SECURITY  CLASS,  for  (h/a  r*por>; 

Office  of  Naval  Research  unclassified 

Information  Systems 
Arlington,  VA  22217 


12.  REPORT  DATE 

July  1991 


<3.  number  of  pages 

45 


ISa.  DECLASSIFICATION/ DOWN  GRADING 

schedule 


16.  DISTRIBUTION  STATEMENT  (ol  thi*  Kapori; 


Distribution  of  this  document  is  unlimited, 


t7.  OfSTRieuTlON  STATEMENT  (of  mUe^d  Ifi  Stock  20,  It  dIUormt  from  Roport) 


19.  KEY  WOROS  (Continue  on  roecrco  cldo  It  noeocemry  ond  Idonfl^  by  biock  number) 

natural  language  processing;  semantics;  scope  ambiguity;  relational 
semantics;  DRT;  logical  forms 


20.  ABSTRACT  (Continue  on  reeeree  elde  U  neeeeemfy  end  Identity  by  block  number) 

Relational  semantics  can  be  used  to  give  a  denotation  to  the  non-disambiguated 
logical  forms  used  by  natural  language  processing  systems,  representations  in 
which  the  quantifiers  are  left  "in  situ."  Giving  a  semantics  to  these  logical 
forms  makes  it  unnecessary  for  the  system  to  compute  all  the  disambiguated 
interpretations  of  a  sentence  before  storing  its  representation  in  the  knowledc 
base.  Rules  of  inference  can  be  defined  so  that  the  disambiguation  orocess  can 
be  formally  modeled  in  a  declarative  way.  'Weaker'  rules  of  inference  can  also 
be  specified  so  that  conclusions  can  be  derived  from  the  non-disambiguated 


DD 


EDITION  OF  I  NOV  6S  IS  OBSOLETE 


representation. 

SECURITY  CLASSIFICATION  OF  THIS  PAGE  flWian  Data  Enlaraif; 


Contents 


1  The  Problem  3 

2  Some  Solutions  6 

2.1  Disjunction .  6 

2.2  Vagueness .  6 

2.3  Dependency  Functions .  7 

2.4  Preferences  and  Backtracking .  8 

3  A  Crash  Course  on  DRT  11 

4  Relational  Semantics  14 

4.1  Meanings  as  Relations .  14 

4.2  A  Relational  Semantics  for  DRT .  15 

The  Syntax  of  DRTq .  15 

The  Semantics .  16 

The  Inference  Rules  .  17 

5  A  Relational  Semantics  for  Unscoped  Logical  Forms  18 

5.1  A  Non- disambiguated  Representation .  18 

5.2  Scope  Forests .  18 

Ordering  Constraints .  19 

Negation  and  Indefinites .  19 

Plural  Anaphora  to  Quantifiers .  21 

5.3  Reasoning  With  Scope  Forests  .  22 

Reasoning  without  Disambiguating .  22 

Inference  Rules  for  Scope  Disambiguation .  23 

5.4  Reasoning  with  Scope  Forests:  Examples  . .  25 

An  Elementary  Model  of  Reference  Disambiguation .  26 

Disambiguation  By  Deduction .  27 

6  A  Formal  Presentation  of  DRTx  30 

6.1  The  Syntax .  30 

6.2  The  Semantics  .  31 

6.3  The  Inference  Rules  .  34 


1 


7  Discussion  35 

7.1  Preferences  . .  35 

7.2  The  Role  of  Syntactic  Constraints  .  36 

7.3  How  Many  Interpretatitnns  are  Computed? .  37 

7.4  Other  Types  of  Ambiguity? .  37 

8  Conclusions  and  Future  Work  30 

8.1  Reasoning  Without  Disi^ambiguating? .  39 

8.2  Formalizing  Preferences.- .  39 

8.3  Tense  and  Scope  Ambiguities  . .  40 

8.4  Other  Issues .  40 


2 


1  The  Problem 


In  using  (1),  a  speaker  could  mean  that  there  is  some  one  undergraduate  who  is  dating 
all  male  students,  or  merely  that  all  male  students  date  some  undergraduate  or  other. 

(1)  Every  male  student  dates  an  undergrad. 

The  conventional  view  is  that  each  sentence  with  multiple  interpretations  is  to  be  seen  as 
ambiguous,  that  is,  each  interpretation  has  to  be  represented  by  a  distinct  formula.  The 
two  interpretations  of  (1)  are  represented  by  (2a)  and  (2b). 

/9x  a.  ((Vi)  (ms(i)  D  {{Sy)  v{y)  A  D(i,y)))) 
b.  ((3j/)  (u(y)  A  ((Vi)  (ms(i)  D  d(i,  y))))) 

“Traditional”^  Natural  Language  Processing  (NLP)  systems,  such  as  TEAM  [Grosz  et  al., 
1987]  or  the  Core  Language  Engine  [Alshawi  et  a/.,  1988],  built  according  to  this  view, 
analyze  (1)  more  or  less  as  follows:  First,  the  parser  computes  a  logical  form  [Webber,  1978; 
Schubert  and  Pelletier,  1982;  Allen,  1987;  Alshawi  and  van  Eijck,  1989]  which  is  similar  to 
the  S-structure  representation  of  (1)  before  Quantifier  Raising  [May,  1985]: 

(3)  [<every  x  male-student>  dates  <a  y  undergrad>] 

AU  the  unambiguous  interpretations  of  (1)  are  then  extracted  from  (3)  by  algorithms  like 
that  proposed  by  Hobbs  and  Shieber  [1987].  Finally,  the  system  must  choose  an  interpre¬ 
tation,  which  is  normally  done  using  preference  heuristics  [Hurum,  1988b]. 

The  disadvantages  of  this  method  have  not  gone  unnoticed  [Kempson  and  Cormack, 
1981;  Hobbs,  1983;  Allen,  1991;  Hirst,  1990].  In  this  thesis  I  will  concentrate  on  two  of  the 
problems  discussed  in  the  literature.  The  first  problem  is  that  a  system  like  the  one  just 
described  cannot  use  information  which  comes  later  in  the  discourse.  Yet,  that  information 
could  save  the  system  considerable  work.  Suppose,  for  example,  that  sentence  (1)  is  followed 
by  sentence  (4),  and  that  the  system  is  able  to  conclude  that  her  is  anaphoric  to  an  undergrad 
in  (1).  It  could  immediately  conclude  that  an  undergrad  in  (1)  has  wide  scope. 

(4)  I  met  her  yesterday. 

The  second  problem  is  that  the  number  of  interpretations  can  be  very  large,  and  therefore 
computing  them  all  can  be  very  expensive.  This  great  number  of  interpretations  is  caused 
by  at  least  two  factors:  First,  the  number  of  scopally  distinct  interpretations  grows  with  the 
factorial  of  the  number  of  NP’s,  with  the  result  that  sentence  (5)  has  5!  =  120  interpretations. 
Yet,  people  do  not  seem  to  entertain  120  possibilities  when  hearing  (5). 

(5)  In  most  democratic  countries  most  politicians  can  fool  most 

of  the  people  on  almost  every  issue  most  of  the  time.  [Hobbs,  1983] 

’  The  use  of  “traditional”  here  should  not  be  thought  of  as  derogatory.  Actually,  it  is  more  synonymous 
with  “working”. 
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Scope  ambiguities  can  combine  with  other  forms  of  ambiguity,  and  this  increases  the  number 
of  disambiguated  interpretations  even  further.  By  solely  considering  that  in  a  sentence  like 
(6a)  the  examiners  may  be  involved  to  a  different  degree  in  the  grading,  Kempson  and 
Cormack  [1981]  are  able  to  find  at  least  four  interpretations  for  it. 

...  a.  Two  examiners  marked  six  scripts. 

'  '  b.  Three  Frenchmen  visited  five  Russians. 

(6a)  can  be  used  to  mean  (i)  that  the  same  six  scripts  were  each  marked  by  two  examiners, 
(ii)  that  two  examiners  marked  six  (not  necessarily  the  same)  scripts  each,  (iii)  that  two 
examiners  marked  a  group  of  six  scripts  between  them,  and  (iv)  that  two  examiners  each 
marked  the  same  set  of  six  scripts.  As  for  (6b),  Partee  ([1975],  quoted  by  Bunt  [1985])  argues 
that  it  has  eight  readings;  Bunt,  also  counting  collective  and  distributive  interpretations, 
is  able  to  find  30  different  readings  for  it!  And  then  one  must  take  into  account  lexical 
ambiguity,  referential  ambiguity,  and  so  forth.  It  seems  unlikely  that  people  generate  all 
these  interpretations  when  processing  (5),  (6a)  or  (6b).  Even  30  interpretations  seems  too 
large  a  number  to  be  actually  considered.  And  yet,  sentences  of  this  type  are  more  common 
than  normally  believed,  as  shown  by  sentences  (7)-(9),  taken  from  a  set  of  10  computer 
news  articles: 

There  also  was,  however,  no  change  in  the  long-term  belief  by  many 
(7)  people  in  the  capital  -  some  Republicans  as  well  as  nearly  all 

Democrats  -  that  Quayle  was  simply  unqualified  to  become  president. 

.Qv  McGee  has  used  the  whistle-blowing  technique  numerous  times 
'  ^  over  the  past  several  years. 

The  yacht  was  often  used  for  social  and  political  events 
'  ^  by  several  presidents  until  Carter  disposed  of  it. 


and  people  don’t  seem  to  have  trouble  with  these  sentences. 

Relational  Semantics  is  a  semantic  system  related  to  Situation  Semantics  [Barwise 
and  Perry,  1983]  and  developed  to  explicitly  represent  the  different  ways  in  which  NP’s 
contribute  to  discourse  [Heim,  1982;  Barwise,  1987;  Rooth,  1987].  One  of  the  basic  ideas  of 
relational  semantics  is  that  the  denotation  of  a  sentence  is  not  a  truth  value,  but  a  relation, 
that  is,  a  set  of  pairs.  The  first  element  of  each  pair  is  a  variable  assignment  which  gives  to 
the  variables  values  which  satisfy  the  discourse  prior  to  the  sentence.  The  second  element 
is  a  variable  assignment  which  satisfies  the  discourse  after  the  sentence  has  been  added  to 
it  [Barwise,  1987;  Rooth,  1987]. 

If  we  use  relational  semantics  instead  of  the  classical  truth-valued  semantics,  we  can 
assign  a  denotation  to  the  “unscoped”  expressions  found  in  the  NLP  literature.  The  de¬ 
notation  of  such  an  expression  will  be  the  union  of  the  relations  denoted  by  each  of  the 
disambiguated  representations  derived  from  that  expression.  Why  is  it  useful  to  give  a 
denotation  to  these  expressions?  We  can  exploit  this  ability  in  three  ways:  First,  we  can 
define  weaker  versions  of  the  standard  inference  rules,  so  that  certain  types  of  conclusions 
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can  be  derived  without  disambiguating.  Second,  we  can  define  semantically  justified  infer¬ 
ence  rules  so  that  context  information  can  be  used  for  a  complete  or  partial  disambigua¬ 
tion.  Third,  we  may  conceive  a  declarative  way  of  dealing  with  the  phenomenon  of  scoping 
preferences,  the  fact  that  people  prefer  certain  interpretations  over  others  [Lakoff,  1971; 
loup,  1975;  Johnson- Laird,  1977;  Kempson  and  Cormack,  1981;  Fodor,  1982;  Hurum,  1988a; 
Moran,  1988;  Kurtzman  and  MacDonald,  1991].  If  desired  (and  if  the  logic  presented  in  this 
paper  is  extended  in  the  appropriate  way)  we  can  represent  these  preferences  as  plausible 
axioms  and  model  their  interaction  with  the  other  sources  of  disambiguating  information 
explicitly. 

A  NLP  system  using  the  representation  I  will  propose  needn’t  compute  all  the  interpre¬ 
tations  of  (1)  right  away.  Once  such  a  system  has  translated  (1)  into  an  unscoped  logical 
form,  it  can  immediately  store  the  logical  form  in  the  discourse  representation.  At  this 
point,  the  system  may  decide  whether  to  disambiguate  or  not  according  to  some  sort  of 
utility  measure.  If,  later  in  the  discourse,  a  sentence  like  (4)  is  also  asserted,  and  if  the 
system  is  able  to  conclude  that  her  in  the  second  sentence  is  anaphoric  to  an  undergrad  in 
the  first  sentence,  it  will  also  be  able  to  conclude  that  an  undergrad  takes  scope  over  every 
male  student.  It  should  be  clear  that,  in  this  way,  both  of  the  problems  with  the  previous 
kind  of  architecture  are  solved. 

I  will  first  discuss  in  section  2  three  proposals  for  dealing  with  scope  ambiguity  -  the 
idea  of  using  disjunction,  the  “radical  vagueness  proposal”  of  Kempson  and  Cormack,  and 
Hobbs’  solution  based  on  dependency  functions.  I  will  explain  why  these  solutions  are  all 
incomplete  in  one  way  or  the  other.  I  will  also  discuss  an  alternative  to  the  method  of 
generating  all  the  interpretations  -  the  idea  of  using  preference  principles  to  generate  a 
single  interpretation  and  then  backtrack  in  case  a  contradiction  is  found.  This  solution 
is  not  explicitly  presented  anywhere  in  the  literature,  but  is  implicitly  adopted  by  several 
systems:  I  will  argue  that  a  semantics  of  the  kind  presented  in  this  paper  is  a  necessary 
prerequisite  for  implementing  a  system  which  works  in  this  way. 

In  order  to  make  the  proposal  more  precise  I  will  need  a  discourse  representation  which 
makes  it  eaisy  to  talk  about  certain  discourse  inferences,  particularly  anaphora  resolution.  I 
wiU  therefore  use  a  style  of  representation  based  on  Discourse  Representation  Theory  (DRT) 
[Kamp,  1981],  with  the  hope  that  this  kind  of  representation  will  be  better  known  than,  say. 
Dynamic  Montague  Grammar  [Groenendijk  and  Stokhof,  1990]  or  Episodic  Logic  [Schubert 
and  Hwang,  1990].  I  will  review  DRT  very  briefly  in  section  3. 

I  will  introduce  relational  semantics  in  section  4,  and  show  how  one  can  replace  Kamp’s 
semantics  for  DRT  with  one  based  on  relational  semantics.  (After  this  redefinition,  the 
difference  between  the  form  of  representation  I  use  and  the  others  I  mentioned  will  reduce 
considerably.) 

Section  5  contains  the  core  idea  of  the  proposal;  I  will  first  show  in  section  5.1  that  one 
can  use  relational  semantics  to  add  a  construct  called  scope  forest  to  the  logic  presented  in 
section  4.2,  so  that  scopally  ambiguous  sentences  can  be  given  a  single  representation.  I 
will  also  introduce  a  new  set  of  inference  rules.  Before  introducing  the  logic  more  formally, 
I  will  give  examples  of  derivations  using  this  logic  in  section  5.4. 

In  section  7  I  will  discuss  the  implications  of  the  theory,  and  answer  some  possible 
objections.  I  will  also  consider  in  more  detail  the  issue  of  preferences. 
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2  Some  Solutions 


2.1  Disjunction 

The  easiest  way  to  represent  a  sentence  with  multiple  interpretations  without  loss  of 
information  is  to  represent  that  sentence  as  a  disjunction  of  its  interpretations.  Sentence 
(1),  for  example,  would  be  represented  by  the  disjunction  (10). 

nnt  (Ms(i)  D  ((3y)  u(y)  A  D(x,y))))  V 

^  ^  ((3y)  (u(y)  A  ((Vx)  (ms(x)  D  D(x,y))))) 

In  this  manner,  it  is  possible  to  take  advantage  of  disambiguation  information  in  the  context. 
This  solution  has  several  problems,  however.  One  of  these,  as  Kempson  and  Cormack  point 
out  [Kempson  and  Cormack,  1981],  is  the  Mapping  Problem:  there  are  a  number  of  reasons 
for  preferring  as  the  semantic  representation  of  a  sentence  a  logical  structure  as  close  as 
possible  to  its  syntactic  structure.  (10)  isn’t  such  a  representation.^  A  second,  and,  1 
think,  decisive,  argument  against  this  method  is  that  it  requires  all  the  interpretations  to 
be  computed,  and  therefore  does  not  solve  the  combinatorial  explosion  problem. 

2.2  Vagueness 

Kempson  and  Cormack  [1981]  contend  that  the  conventional  view  is  misled.  Even  if 

(I)  or  (6a)  have  different  interpretations  (as  K&C  put  it,  they  are  logically  ambiguous), 
they  claim  that  those  sentences  are  not  linguistically  ambiguous,  that  is,  they  have  a  single 
semantic  representation.  In  their  view,  the  representation  of  a  sentence  with  multiple  inter¬ 
pretations  is  the  weakest  representation  entailed  by  all  interpretations.  This  proposal  works 
fairly  well  for  sentences  such  as  (1),  because  the  two  interpretations  of  that  sentence  are 
not  in  fact  distinct:  the  reading  in  which  a  single  undergraduate  is  dating  all  male  students 
entails  the  other.  The  representation  initially  proposed  by  Kempson  and  Cormack  for  (1) 
is  (11). 

(II)  3M  3f/i  3u„€t/  D(m,u) 

In  order  to  extend  this  method  to  represent  sentences  like  (6a),  however,  something  more 
drastic  is  called  for,  since  none  of  the  interpretations  of  (6a)  is  entailed  by  each  of  vhe  other 
three;  the  120  interpretations  of  (5b)  are  also  all  distinct.  In  order  to  give  a  unique  semantic 
representation  to  sentences  like  these,  Kempson  and  Cormack  must  introduce  a  second 
version  of  the  theory,  in  which  a  much  weaker  representation  is  used.  The  representation 
of  (6a)  is  (12a),  and  the  representation  of  (1)  in  the  second  version  of  the  theory  is  shown 
in  (12b). 

a.  3A'2  356  ^XxeXs  m(x,s) 

b.  3M  3Ui  3mmeM 

^Finding  the  logical  representation  for  (5)  is  left  as  an  exercise  to  the  reader. 
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(12b)  says  that  there  is  a  set  of  male  students  and  a  set  of  undergrads,  and  that  one  male 
student  dated  one  undergrad.  These  truth  conditions  are  much  too  weak  a  representation 
of  (1):  a  NLP  system  using  (12b)  as  the  representation  for  (1)  would  have  to  pay  a  high 
price  to  avoid  computing  all  the  interpretations. 

This  is  not,  however,  what  Kempson  and  Cormack  have  in  mind.  Their  idea  is  that  (12b) 
is  not  the  final  representation  of  (1),  but  only  the  ‘basis’  from  which  the  real  interpretations 
can  be  generated  by  means  of  two  operations: 

•  uniformising:  when  an  existential  quantifier  follows  a  universal,  reverse  their  order 

•  generalising:  turn  an  existential  quantifier  into  a  universal. 

But  in  this  case  we  are  left  with  something  not  much  different  from  a  ‘traditional’  system 
-  the  extraction  operations  do  the  job  that  in  a  traditional  system  would  be  done  by  an 
algorithm  like  Hobbs  and  Shieber’s  (assuming  that  one  can  justify  these  operations  semanti¬ 
cally,  which  Kempson  and  Cormack  don’t)  and  the  ‘filters’  that  Kempson  and  Cormack  use 
to  choose  one  interpretation  over  the  rest  are  not  much  different  from  preference  heuristics. 


2.3  Dependency  Functions 

The  proposal  advanced  by  Hobbs  in  [Hobbs,  1983]  falls  in  a  third  class  of  solutions,  all 
based  on  the  idea  of  representing  scope  relations  as  dependency  functions.  Hobbs’  solution  is 
based  on  a  certain  set  of  assumptions.  First,  Hobbs  wants  to  use  a  first-order  representation, 
with  variables  ranging  over  sets.  Second,  he  represents  determiners  as  relations  between 
two  sets  -  but  the  sets  he  has  in  mind  are  not,  however,  the  set  of  sets  denoted  by  the  NP 
and  the  set  denoted  by  the  VP.  He  instead  paraphrases  a  sentence  like  Most  men  work  as 
‘there  exists  a  set  s  which  represents  a  majority  of  the  set  of  all  men,  and  for  each  individual 
y  in  s,  y  works’.  This  paraphrase  becomes,  in  his  representation,  the  formula  (13). 

(13)  (3s)  (most(s,  {x|man(z)})  A  (Vy)(y  €sD  woRK(y))) 

Hobbs’  third  assumption  is  that  sets  have  typical  elements.  The  typical  element  of  a  set  s 
is  an  individual  r(s)  defined  by  the  following  axiom: 


(14)  (Vs)P5(t(s))  =  (Vy)(j/  €  s  D  P{y)) 


Where  Pg  is  a  predicate  which  is  like  P  except  that  it  is  also  true  of  r(s)  iff  P  is  true  of  all 
the  elements  of  s.  Hobbs’  representation  for  (1)  is  (something  like)  (15),  which  can  be  read 
as  follows:  there  is  a  set  m  which  includes  all  the  male  students,  a  set  u  which  contains  one 
undergrad,  and  the  typical  element  of  m  dates  the  typical  element  of  u. 


(15) 


(3m,mi,U,Ui)  (EVERY(m,mi)  A  A(«,  Ui)  A  MALE-STUDENT^,  (r(mi)) 
A  UNDERGRADE, (r(ui))  A  DATES(r(m), r(w))) 


Finally,  scope  relations  are  represented  using  dependency  functions.  A  dependency  function 
/  returns,  for  each  male  student  x,  the  set  of  undergrads  that  x  dates: 


7 


f{x)  =  {j/|  undergrad(j/)  a  dates(x,  y)}. 


If  the  inferencing  component  discovers  that  there  is  a  different  set  u  for  each  element  of 
the  set  m,  u  can  be  viewed  as  referring  to  the  typical  element  of  this  set  of  sets,  and  the 
fact  u  =  t({/(i)|i  £  m})  can  be  added  to  the  knowledge  base.  There  are  two  problems 
with  this  solution:  First  of  all,  as  Hobbs  points  out,  the  representation  in  (13)  can  only  be 
used  with  monotone  increasing  determiners,  like  most  amd  every.  For  example,  if  we  were 
to  represent  No  man  works  hard  in  Hobbs’s  representation,  we  would  be  able  to  conclude 
that  no  man  works,  which  instead  doesn’t  follow  because  no  is  not  monotone  increasing. 
The  second  problem,  common  to  other  dependency  function- based  solutions,  is  that  only 
sentences  with  two  quantifiers  can  be  given  a  scope-neutral  representation,  and  not,  for 
example,  sentences  with  a  quantifier  and  negation,  such  as  John  doesn’t  have  a  car. 

2.4  Preferences  and  Backtracking 

It  has  often  been  observed  in  the  psycholinguistic  and  syntactic  literature  that  people 
prefer  certain  interpretations  over  others  [Lakolf,  1971;  loup,  1975;  Johnson-Laird,  1977; 
Kempson  and  Cormack,  1981;  Fodor,  1982].  For  example,  most  people  seem  to  agree  that 
Every  male  student  takes  wide  scope  in  (1),  and  the-NP  a  kid  takes  wide  scope  in  (16). 

(16)  A  kid  climbed  every  tree. 

These  preferred  readings  have  been  explained  by  stipulating  the  existence  of  psychologically 
motivated  principles  used  when  parsing  their  sentences.  According  to  the  Linear  Order 
Principle,  for  example,  the  preferred  scope  ordering  of  quantified  phrases  matches  the  left- 
to-right  ordering  of  the  phrases  in  the  sentence.  This  principle  goes  back  to  work  by  Lakoff 
[Lakoff,  1971],  (which  actually  claimed  that  sentences  like  (1)  are  unambiguous  because  of 
this  principle!),  and  would  explain  why  we  get  the  preferred  readings  in  1  and  (16). 

The  stronger  version  of  Lakoff’s  claim  is  clearly  untenable,  because  of  sentences  like 
(17a)  and  (17b): 

(17)  a.  There  wais  a  fish  on  every  plate.  [Chierchia  and  McConnell-Ginet,  1990] 

b.  Every  student  likes  the  professor  of  anthropology. 

loup  [1975]  proposed  however  the  following  revisions  to  Lakoff’s  proposal.  She  replaced 
the  Linear  Order  Principle  with  a  Surface  Subject  principle,  according  to  which  the  surface 
subject  tends  to  take  wide  scope,  especially  if  it  coincides  with  the  deep  subject.  She  also 
introduced  a  second  principle,  known  as  the  Quantifier  Hierarchy  Principle,  according  to 
which  quantifiers  are  organized  in  a  hierarchy  according  to  the  ease  with  which  they  take 
wide  scope.  (For  example,  quantifiers  like  each  or  any  tend  to  take  wider  scope  more  often 
than  some  or  a  few.)  The  first  principle  would  explain  why  the  preference  for  the  subject 
to  have  wide  scope  is  less  pronounced  in  passive  sentences  like  (18);  the  second  principle 
would  explain  why  the  professor  of  anthropology  takes  wide  scope  in  (17b). 

(18)  A  tree  was  climbed  by  every  boy. 
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On  the  basis  of  these  considerations,  one  might  argue  for  the  following  kind  of  architecture: 
instead  of  computing  all  the  interpretations,  the  system  has  the  Left  Order  Principle  “built 
in”,  possibly  corrected  by  the  Quantifier  Hierarchy  Principle  and  generates  only  one  inter¬ 
pretation,  the  one  which  agrees  with  the  principle.  If  inconsistencies  are  found,  the  system 
backtracks,  and  computes  another  interpretation.  To  my  knowledge,  this  architecture  has 
never  been  explicitly  proposed  in  the  literature,  but  it  looks  like  a  potential  solution  to  the 
problems  with  the  ‘traditional’  systems  mentioned  above. 

One  can  however  easily  find  counterexamples  to  each  of  the  principles  proposed  in  the 
literature: 

(19)  a.  Every  school  in  this  district  commemorates  an  episode 
that  occurred  some  years  ago. 

b.  The  cost  of  everything  from  food  to  cars  can  be  pushed 
artificially  high  by  greedy  retailers. 

c.  The  teacher  has  more  influence  than  the  parents  on  most  children. 

(19a)  violates  the  left  order  principle,  {19c)  violates  both  the  left  order  and  the  quantifier 
hierarchy  principle,  and  the  preferred  interpretation  of  (19b),  (for  me  and  a  couple  of 
informants)  is  the  one  in  which  everything  takes  scope  over  greedy  retailers,  which  in  turns 
takes  scope  over  the  cost  of  .. ..  I  doubt  that  anybody  w’iU  have  problems  understanding 
this  sentence,  and  yet  it  violates  not  only  the  left-to-right  ordering  principle  of  Lakoff,  but 
also  the  heuristic  that  NT’s  with  the  determiner  the  tend  to  take  wide  scope. 

Kurtzman  and  MacDonald  run  experiments  to  test  which  of  these  principles  really  affect 
the  choice  of  the  interpretation,  and  how  they  interact  [Kurtzman  and  MacDonald,  1991]. 
Their  result  was  that  in  the  case  of  active  sentences  there  was  good  evidence  in  favor  of 
the  linear  order  principle,  although  this  preference  is  stronger  for  the  “a  ...  every”  order 
than  for  the  “every  ...  a”  order  (in  contrast  with  the  Quantifier  Hierarchy  Principle).  They 
also  seemed  to  find  evidence  for  the  Thematic  Hierarchy  Principle,  according  to  which  the 
NP's  filling  certain  thematic  roles  (and  the  agent  role  above  all)  tend  to  take  wider  scope. 
Kurtzman  and  MacDonald  didn’t  find  any  principle  of  general  applicability,  however;  they 
also  found  that  even  the  Linear  Order  Principle  is  not  used  by  all  speakers.^ 

The  problem  is  that  even  if  we  concede  that  these  principles  actually  exist,  they  certainly 
compete  with  each  other  and  can  be  overridden  by  world  knowledge.  A  sentence  is  often  not 
perceived  as  ambiguous  because  it  appears  in  a  context  which  disambiguates  it.  Consider, 
for  example,  the  sentence  Every  graduate  student  has  to  use  an  office  on  the  4th  floor.  By 
itself,  the  sentence  could  either  mean  that  there  is  more  than  one  office,  or  that  all  graduate 
students  share  the  same  office.  In  the  appropriate  context,  however,  one  or  the  other  of  the 
readings  becomes  preferred: 

.  We  have  problems  with  space  this  year.  Every  graduate  student  has  to  use 
'  ^  an  office  on  the  4th  floor 


^It’s  interesting  to  note  that  Kurtzman  and  McDonald’s  results  do  seem  to  be  in  contr2ist  with  the 
predictions  of  approaches  based  on  using  the  weakest  possible  interpretation,  like  Kempson  and  Cormack’s, 
since  people  do  not  seem  to  favor  the  weakest  interpretation,  and  actu^Llly  at  times  have  a  definite  preference 
for  the  strong  one. 
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Given  these  problems,  is  not  clear  how  the  architecture  based  on  preferences  and  back¬ 
tracking  can  be  made  to  work  without  a  clear  and  declarative  way  of  formalizing  the  way 
these  preferences  work,  that  is,  without  a  logic  which  extends  the  one  I  will  present  in 
this  proposal.  The  interpretation  of  (19b)  generated  by  such  a  system,  for  example,  would 
be  completely  different  from  the  preferred  interpretation.  It’s  unclear  how  such  a  system 
would  be  able  to  use  the  disambiguating  information  given  by  the  context  without  such  a 
formalization;  and  it’s  even  less  clear  how  such  a  system  would  decide  when  to  backtrack, 
and  how  it  would  decide  which  interpretation  to  try  next. 

These  objections  do  not  diminish  the  appeal  of  the  idea  of  using  the  preferences  and 
backtracking.  One  of  the  goals  of  my  future  work  is  to  extend  the  logic  I  will  present  in  this 
paper  in  such  a  way  that  this  kind  of  architecture  can  be  formalized,  and  then  compare  the 
two  approaches. 
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3  A  Crash  Course  on  DRT 


The  version  of  Discourse  Representation  Theory  (DRT)  developed  by  Hans  Kamp  [Kamp, 
1981]  was  originally  meant  to  provide  (i)  a  general  account  of  the  conditional;  (ii)  an  ac¬ 
count  of  the  meaning  of  indefinite  descriptions  and  (ill)  an  account  of  pronominal  anaphora, 
especially  in  “donkey”  sentences,  i.e.,  sentences  like  (21)  and  (22). 


(21)  Every  man  who  owns  a  donkey  beats  it. 

(22)  If  a  man  owns  a  donkey,  he  beats  it. 

Kamp  (and  Heim  [Heim,  1982])  attempt  to  “build  the  discourse  structure  into  the  logic”, 
that  is,  to  relate  the  constraints  on  anaphora  to  the  meaning  of  discourse.  In  the  represen¬ 
tation  defined  in  [Kamp,  1981],  the  traditional  formulas  of  first  order  logic  are  replaced  by 
Discourse  Representation  Structures  (DRS’s),  which  are  pairs  (Af,  C),  where  M  is  a  set  of 
markers  drawn  from  some  set  V,  and  C  a  set  of  conditions.  For  example,  sentence  (23)  is 
represented  in  DRT  by  the  DRS  in  (24). 


(23 )  Pedro  owns  a  donkey. 


(24) 


X  y 

PEDRO(x) 

DONKEY(y) 

owNs(x,y) 


This  DRS  contains  the  two  markers  i  and  y,  and  a  set  of  atomic  conditions  like  DONKEY(y). 
Other,  complex  conditions  composed  of  nested  DRS’s  are  used  to  represent  other  connectives 
and  for  universal  quantification  (see  below.) 

Most  of  the  empirical  import  of  DRT  comes  from  the  definition  of  the  DRS  construction 
rules,  and  above  all  those  for  the  interpretation  of  NP’s  and  of  conditionals.  Two  of  these 
rules  were  used  to  build  the  DRT  representation  of  sentence  (23)  above. 

proper  names  rule:  if  a  is  a  proper  name,  a  new  marker  v  (x  in  the  example  above)  is 
added  to  the  universal  DRS  (that  is,  the  one  not  embedded  in  any  other),  and  a  new 
atomic  condition  of  the  form  a(u)  (pEDRO(x)  in  the  figure)  is  to  the  same  universal 
DRS. 

indefinite  NPs  rule:  if  a  a  is  an  indefinite  NP  (e.g.,  a  donkey)  then  a  new  marker  u  (y  in 
the  previous  example)  is  added  to  the  current  DRS,  and  a  new  atomic  condition  of  the 
form  a(u)  (DONKEY(y)  in  the  example)  is  added  to  the  same  DRS.  I  will  show  shortly 
how  this  rule  has  been  defined  in  this  way  to  explain  why  indefinites  like  a  donkey 
take  an  existential  reading  in  sentences  like  (23),  but  a  universal  one  in  conditional 
sentences  like  (21). 
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The  purpose  of  Kamp’s  move  from  formulas  to  DRS’s  becomes  clear  when  an  actual  discourse 
is  considered.  A  DRT  representation  for  discourse  (25)  is  obtained  by  just  adding  the 
conditions  for  the  second  sentence  to  the  previously  shown  DRS  representing  (23),  as  shown 
in  (26). 


(25)  Pedro  ow'ns  a  donkey.  He  hates  it. 


(26) 


X  y  u  V 

PEDRO(x) 

DONKEY(y) 

owNs(x,y) 

U  =  X 
V  =  y 

HATES(u,v) 


The  conditions  for  it  are  obtained  by  another  DRS  construction  rule,  the  pronoun  construc¬ 
tion  rule: 

pronoun  construction  rule:  If  a  is  a  pronoun,  introduce  a  new  marker  v  to  the  current 
DRS,  choose  a  suitable  marker  u  from  the  currently  accessible  ones,  and  add  to  the 
current  DRS  a  new  condition  v  =  u.  ( A  marker  ti  is  accessible  from  the  DRS  K  if  either 
u  is  local  to  K,  or  is  introduced  into  a  DRS  which  contadns.  K,  as  discussed  below.) 

Every  NP’s  like  every  man  in  the  ‘donkey’  sentence  (21),  are  handled  by  the  following  DRS 
construction  rule: 

every  construction  rule:  If  every  a  /?  is  a  sentence,  add  to  the  current  DRS  a  new  com¬ 
plex  condition  of  the  form  Ki  =»  K2,  where  Ki  and  K2  are  DRS’s,  adding  a  new 
marker  u  to  A'l,  adding  the  conditions  for  a  to  Ki,  and  the  conditions  for  /?  to  A’2. 

The  results  of  applying  this  rxile  to  sentence  (21)  are  shown  in  (27). 


(27) 


X  y 

U  V 

PEDRO(x) 

U  =  X 

DONKEY(y) 

owNs(x,y) 

V  =  y 

HATES(u,v) 

“Accessibility”  in  DRT  is  a  way  of  representing,  in  a  “geometrical”  fashion,  the  constraint  on 
anaphora  known  as  Scope  Constraint,  here  presented  in  a  formulation  due  to  Heim  [1982]: 

Scope  Constraint  (SC);  Do  not  adjoin  an  NP  any  higher  than  to  the  lowest  S  in  which 
it  originates. 
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This  constraint  states  that  no  quantified  expression  can  take  wider  scope  than  the  clause 
in  which  it  originates.  The  SC,  together  with  the  other  usual  constraint  that  no  quantified 
expression  can  serve  as  the  antecedent  of  a  pronoun  outside  the  scope  of  that  expression,  is 
intended  to  explain  why  discourses  like  (28)  are  ungrammatical.  The  scope  of  every  man  is 
limited  to  the  first  sentence,  and  therefore  it  cannot  serve  as  antecedent  of  he. 

(28)  *Every  man  owns  a  donkey.  He  hates  it. 

DRT  introduces  the  “boxes”  to  make  this  constraint  more  apparent.  A  complete  definition 
of  the  accessibility  conditions,  which  also  takes  into  account  complex  DRS’s  as  (27),  is  as 
follows:  a  marker  u  is  accessible  from  the  DRS  K  if  either  u  is  local  to  K,  or  is  introduced  in 
a  DRS  K’  such  that  K'  =»  K,  or  is  introduced  in  a  DRS  K’  which  contains  K.  This  definition 
predicts  that  since  in  the  representation  of  (25)  both  the  markers  x  and  y  are  in  the  same 
DRS  to  which  the  conditions  for  He  hates  it  are  to  be  added,  they  are  accessible,  and  can 
therefore  be  used  as  antecedents  for  He  and  it,  respectively.  To  understand  how  accessibility 
and  the  box  notation  can  be  used  to  explain  the  ungraunmaticality  of  (28),  think  of  how  the 
DRS  in  (27)  could  be  extended  to  represent  discourse  (28):  in  analogy  to  what  was  done 
for  (25),  new  markers  z  and  w  would  be  added  to  the  DRS  for  (21),  but  there  would  be  no 
accessible  marker  to  be  equated  with  them,  since  the  markers  i  and  y  would  be  embedded 
in  a  DRS  not  accessible  from  the  outer  DRS. 

A  model  for  DRT  is  a  structure  (U,  F)  with  universe  U  and  interpretation  function  F. 
F  assigns  an  element  of  U  to  each  proper  name  of  the  language  (in  this  case,  English),  a 
subset  of  U  to  each  of  its  basic  common  nouns  and  basic  intransitive  verbs,  and  a  set  of 
pairs  of  elements  of  U  to  each  basic  transitive  verb. 

Truth  is  again  defined  in  two  stages:  First,  Kamp  introduces  the  notion  of  verification, 
which  is  analogous  to  the  notion  of  satisfaction  in  ordinary  logic.  An  embedding  function  is 
a  function  f  :  V  ^  V ,  where  V*  is  a  subset  of  V.  The  embedding  function  /  verifies  the 
DRS  K,  iff  dom(/)  =  the  set  Vk  of  markers  of  K,  and  for  each  condition  Ci  ^  K,  f  verifies 
C,.  Verification  for  atomic  conditions  is  defined  as  satisfaction  for  atomic  formulas  relative 
to  an  assignment.  If  an  atomic  condition  C  is  of  the  form  a  —  or  q(/?,7),  that  is, 
of  the  form  R{ai, . .  .Qn),  /  verifies  C  iff  (/(oi) . .  ./(a„))  €  F{R). 

The  verification  conditions  for  a  complex  condition  of  the  form  Ki  K2  are  defined 
as  follows.  Let  us  first  extend  the  Vk  notation  used  before  to  embedded  DRS’s:  if  K  is  the 
‘universal’  DRS  Vk  is  the  set  of  markers  ‘introduced’  in  K,  else  Vk  is  the  set  of  markers 
introduced  in  K  together  with  all  the  markers  introduced  in  the  DRS’s  in  K  is  embedded. 
An  embedding  /  verifies  a  condition  of  the  form  A'l  =>  K2  iff  for  every  embedding  g  such 
that  g  verifies  A'l  (which  means  that  dom(j)  =  VX', ,  and  g  verifies  all  the  conditions  in  Ki) 
there  is  an  embedding  h  which  verifies  K2.  It’s  ezisy  to  verify  that  this  definition  gives  to 
the  indefinite  a  donkey  in  (21)  the  desired  universal  force. 

Informally,  a  marker  is  free  in  a  DRS  K  if  it  is  used  in  some  condition  but  not  ‘listed 
at  the  top’.  A  DRS  without  free  markers  K  is  then  defined  to  be  true  wrt  a  model  M  = 
(fJ ,  F)  iff  there  is  some  embedding  function  /  with  values  in  U  such  that  /  verifies  K.  At 
this  point,  one  can  define  logical  truth  and  entailment  as  usual. 
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4  Relational  Semantics 


Ik  this  section  I  will  review  the  main  ideas  of  relational  semantics.  To  make  the  expla- 
natio>n  concrete,  I  will  show  how  one  can  specify  a  relational  semantics  for  a  DRT  represen¬ 
tation.  An  additional  goal  of  this  section  is  to  show  that  DRT  can  be  used  as  a  logic  in  the 
conve?ntional  sense,  that  is,  to  perform  inferences.  I  will  call  the  resulting  logic  DRTq. 

4.1  Meanings  as  Relations 

Relational  Semantics  was  originally  developed  to  model  explicitly  the  kinds  of  con¬ 
straints  that  NP’s  impose  on  discourse.  These  constraints  can  be  expressed  in  terms  of  the 
requirements  on  variable  assignments.  For  example,  after  the  sentence  A  farmer  with  a 
donktey  beat  it  has  been  added  to  a  discourse,  every  variable  assignment  /  which  satisfies 
the  mew  discourse  must  assign  to  the  variable  used  to  represent  the  NP  a  farmer  a  value  not 
used  before.  After  a  sentence  like  He  sat  under  the  table,  instead,  every  variable  assignment 
which  satisfies  the  discourse  must  give  to  the  variable  used  to  represent  He  the  same  value 
assigned  to  some  other  variable. 

A  comparison  with  standard  first  order  logic  may  help.  Consider  the  sentence  A  farmer 
with  la  donkey  beat  it,  and  assume  that  the  coindexing  relations  are  those  represented  by 
the  indices  in  (29). 

(29)  [A  farmeri  with  a  donkeyj  beat  it2] 

Sentence  (29)  is  represented  in  first  order  logic  by  (30). 

(30)  3y  [farmer'(x)  A  DONKEY'(y)  A  with'(x,  y)  A  Beat'(x,  y) 

Instead  of  saying  that  (30)  is  true  with  respect  to  a  model,  M,  and  a  variable  assignment, 
f,  we  can  say  that  the  meaning  of  (30)  in  a  model  M  is  the  set  of  assignments  with  values 
in  M  which  satisfy  it: 

(31)  II  [A  farmeri  with  a  donkey2  beat  it2]  ||*^  ={  /  I  /  satisfies  . . .} 

Relational  Semantics  takes  this  one  step  further.  The  crucial  idea  is  that  the  constraints 
imposed  on  a  discourse  by  the  NP’s  can  be  modeled  most  effectively  by  using  partial  vari¬ 
able  assignments  and  by  requiring  that  each  variable  assignment  which  satisfies  the  whole 
discourse  be  an  extension  of  a  variable  assignment  which  satisfies  the  portion  of  discourse 
prior  to  the  last  sentence.  Symbolically,  a  sentence  like  (29)  will  cause  an  extension  in  the 
assignment,  as  shown  in  (32): 

(32)  /  [A  farmer]  with  a  donkey2  beat  it2]  /' 

This  can  be  represented  by  requiring  the  value  of  (29)  to  be  not  a  set  of  assignments  as  in 
(31),  but  a  relation,  that  is,  a  set  of  pairs  of  assignments,  where  the  first  element  of  each  pair 
is  a  variable  aissignment  which  satisfies  the  discourse  prior  to  (29),  and  the  second  element  is 
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a  variable  assignment  which  satisfies  the  discourse  after  (29)  has  been  added  [Barwise,  1987; 
Rooth,  1987]: 

/oox  {(/>/")!  3y3x[FARMER'(a:)  ADONKEY'(i/)A  WlTH'(l,J/)ABEAT'(x,y) 

A/"  =  n  D 

This  idea  of  ‘meaning  as  relations’  is  also  used  in  Situation  Semantics  [Barwise  and  Perry, 
1983]  and  it  is  becoming  increasingly  popular  because  it  is  very  useful  to  capture  certain 
properties  of  anaphoric  relations  [Schubert  and  Pelletier,  1988;  Groenendijk  and  Stokhof, 
1990]. 


4.2  A  Relational  Semantics  for  DRT 
The  Syntax  of  DRTq 

The  set  of  symbols  of  DRTq  includes  a  set  of  property  symbols  (unary  predicates),  a 

set  of  relational  symbols,  and  a  set  of  markers;  xo,  .  •  • ,  . (I  will  sometimes  use 

for  simplicity  letters  without  indices  like  x,  y,  etc.  for  the  markers.)  The  set  of  expressions 
of  DRTq  consists  of; 

1.  marker  introducers  like  where  Xj  is  a  new  marker,  a  marker  not  used  for 
any  other  marker  introducer,  (said  otherwise,  i  roust  be  strictly  greater  than  any 
previously  used  marker  index.) 

2.  conditions: 

(a)  unary  conditions  like  P{xi),  where  Xj  is  a  marker  and  F  is  a  property  symbol. 

(b)  binary  conditions  like  i?(x,,  xj),  where  x,  and  xj  are  markers  and  F  is  a  relation 
symbol. 

(c)  coindexing  conditions  like  Xi  =  xj,  where  x,  and  xj  are  markers. 

(d)  negated  DRSs  of  the  form  -<K,  where  K  is  a  DRS. 

(e)  conditional  DRSs  of  the  form  Ki  — ►  K2,  where  A'l  and  A'j  are  DRSs. 

3.  Discourse  Representation  Structures:  a  DRS  is  an  expression  containing  one 
or  more  conditions  and  zero  or  more  marker  introducers,  usually  written  as  in  (34), 
where  o^®  and  are  marker  introducers,  FARMER(zo)  and  donkey(xi)  are  unary 
conditions,  and  owNs(xo,xi)  is  a  binary  condition. 


(34) 


Q,X0  QjXl 

FARMER(xo) 
DONKEY(xi) 
OWNS  (xo.Xi) 
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In  keeping  with  the  standard  conventions,  I  will  reserve  the  symbol  K,  possibly  with  sub¬ 
scripts,  to  indicate  DRS's.  Subscripted  x’s  like  lo  will  always  indicate  markers.  Let  a  marker 
I  be  free  in  K  if  no  marker  introducer  a*  is  in  K.  A  DRTo  formula  is  a  DRS  with  no  free 
markers.  The  donkey  sentence  Every  farmer  who  owns  a  donkey  beats  it  is  represented  in 
DRTo  by  (35). 


The  only  significant  difference  between  DRTo  and  ^standard'  DRT  is  the  distinction  between 
‘use’  and  ‘introduction’  of  markers.  This  distinction  makes  it  easier  to  enforce  the  constraint 
that  each  marker  has  to  be  new,  as  well  as  simplifying  the  definition  of  the  semantics  of  a 
DRS,  but  otherwise  has  no  semantic  consequences. 

The  Semantics 

A  model  M  for  DRTo  is  a  pair  (1%  F):  U  is  &  nonempty  set,  and  F  an  interpretation 
function.  Assignments  are  called  embedding  functions  in  DRT;  embedding  functions  are 
partial  functions  from  markers  to  objects  of  the  domain.  An  embedding  function  over  M  is 
a  function  which  associates  to  the  markers  values  from  U.  The  denotation  with  respect  to 
M  of  an  expression  of  DRTo  is  a  set  of  pairs  of  embeddings  over  M  defined  as  follows: 

1.  110*^11^  =  {{f,9)\  f  Q  9,  Xi  ^  dom(/)  and  9  =  f  U  (tj,  a),  for  some  a  e  U] 

2.  ||farmer(zOII^  =  f)\  fi^i)  €  F(farmer)} 

3.  ||OWNS(li,Ij)||^  =  (fix,),  f{Xj))  €  F(owns)} 

4.  ||x,  =  x,r  =  {(f,/)| /(!,)  = /(z^)} 

ll^  =  {(f. /)l  there  exist 

(fn-ufn)  e  ||a"-||^and  (/„,/„)  G  ||Ci|r, . .  .HC^nD 


6.  ihKW^  =  w,m,f)^\\K\n 

7.  ||A'i  — ►  ^2!!^  =  {(/",  f)\  for  all  /  s.t.  for  all  extensions  9  s.t.  (/",  ff)  G  ||A’i||^,  there 
exists  h  s.t.  (7,  h)  G  11^211^} 

It  is  easy  to  check  that  the  verification  conditions  in  DRTo  &re  analogous  to  those  of  standard 
DRT,  and  to  verify  that  by  requiring  that  a  marker  x  can  only  be  coindexed  with  a  marker  y 
if  the  assignment  x  is  defined  on  both,  one  also  obtains  the  same  accessibility  conditions  of 
DRT.  Truth  can  be  defined  as  foUows.  A  formula  K  is  true  in  a  model  M  iff  ||A'||^  ^  0.  A 
simple  notion  of  entailment  for  DRTo  can  be  defined  as  follows:  if  Ki  and  A2  are  formulas, 
A'l  ^  A'2  iff  for  all  models  M  in  which  A'l  is  true,  ||A'i||^  C  ||A'2||^- 
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The  Inference  Rules 

I  am  not  aware  of  any  definition  of  inference  rules  for  DRT  in  the  literature,  so  1  will 
introduce  one  that  will  do  for  the  purposes  of  this  article.  A  rule  of  inference  in  DRTo  is  a 
way  of  deriving  a  conclusion  from  a  set  of  premises,  precisely  as  in  first  order  logic.  That 
it,  the  rules  of  inference  of  DRT  are  of  the  form 

Ai, . . .,  An 

K 

where  both  the  premises  K\,...,Kn  and  the  conclusion  K  are  conditions.  The  only  dif¬ 
ference  is  that  these  inference  rules  w-ill  be  DRS-specific,  in  the  sense  that  the  argument  is 
applicable  only  when  the  premises  Ki,...,  A’„  a^e  all  conditions  of  a  single  formula  K'\  the 
conclusion  will  also  be  added  to  the  same  DRS,  obtaining  a  new  DRS  K".  An  inference  rule 
is  acceptable  iff  K"  is  still  a  formula.  A  rule  of  inference  wiU  be  sound  iff  it  is  acceptable, 
and  K'  A'".  An  example  of  sound  rule  of  inference  for  DRTo  is  the  following  version  of 
Modus  Ponens: 

0  — 0.  P 
0 
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5  A  Relational  Semantics  for  Unscoped  Logical  Forms 

5.1  A  Non-disambiguated  Representation 

DRTo  is  not  a  solution  to  our  problems:  the  two  interpretations  of  sentence  (1),  in  fact, 
still  have  to  be  represented  by  distinct  DRTo  formulas: 


a. 


(36) 

b. 


Because  of  the  way  the  semantics  has  been  defined  in  section  4.2,  however,  it  is  relatively 
easy  to  extend  DRTo  with  a  new  construct  which  can  be  used  to  give  a  unique  representation 
to  (1).  In  this  section  I  will  introduce  a  model  of  disambiguation  in  which  sentences  are 
represented  by  scope  forests  whose  denotation  is  the  union  of  the  denotations  of  the 
scopally  disambiguated  interpretations,  and  the  number  of  possible  interpretations  can  be 
restricted  using  inference  rules  which  reflect  either  logical  or  referential  facts. 


ay 

UNDERGRAD(y) 

a' 

male-student(x) 

- >  DATES(x,y) 

5.2  Scope  Forests 


Consider  first  a  slightly  modified  version  of  (1). 
(37)  Every  male  student  dates  most  undergrads. 


The  ‘unscoped  logical  form’  representation  of  (37)  proposed  in  the  NLP  literature  [Schubert 
and  Pelletier,  1982;  Allen,  1987;  Alshawi  and  van  Eijck,  1989]  can  be  rendered  in  DRT  by 
something  like  (38). 


(38) 


My  goal  is  to  give  a  denotation  to  a  DRS  of  this  type,  and  I  can  use  for  this  purpose  the 
relational  semantics  presented  in  section  4.2,  as  follows.  There  are  two  quantified  NP’s  in 
the  sentence,  every  male  student  and  most  undergrads,  and  two  ways  of  ‘ordering’  them  to 
get  an  interpretation.  The  interpretation  of  (37)  in  which  the  universal  takes  wide  scope 
(assuming  a  representation  for  generalized  quantifiers  in  DRT  roughly  analogous  to  the  one 
Kamp  suggests  in  [1988])  is  shown  in  (39). 


qS 

most] 

UNDERGRAD(y) 

DATES(x,y) 

(39) 


male-student(x) 


every, 
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If  we  think  of  the  restrictions  of  the  quantifiers  and  of  the  DRS  which  represents  the  scope  of 
most  undergrads  in  (39)  as  nodes  of  a  graph,  we  can  see  each  way  of  ordering  the  quantifiers 
as  a  path  which  starts  from  the  antecedent  of  the  quantifier  with  wider  scope  and  ends  with 
the  consequent  of  the  quantifier  with  narrower  scope.  I  will  therefore  call  each  such  way  of 
ordering  the  NP's  a  path.  In  the  semantics  of  section  4.2,  each  path  denotes  a  set  of  pairs. 
The  denotation  of  the  logical  form  (38)  can  therefore  be  defined  as  the  union  of  these  sets. 
I  will  call  DRS’s  like  (38)  scope  forests^.  If  the  denotation  of  a  scope  forest  sf  is  the  union 
of  the  denotations  of  the  paths  in  a  set  ps,  I  will  say  that  the  paths  in  ps  are  associated 
with  sf.  The  translation  rules  in  the  grammar  are  such  that  the  interpretation  of  a  sentence 
like  (37)  is  a  scope  forest®.  I  will  call  the  resulting  logic  DRTi. 

Ordering  Constraints 

As  new  facts  about  the  relative  scope  of  the  NP’s  in  a  sentence  s  are  discovered,  the 
number  of  paths  (that  is,  interpretations)  associated  with  the  scope  forest  s/ representing 
s  gets  smaller.  The  inference  rules  for  scope  disambiguation  presented  below  model  this 
process.  These  rules  use  logical  truths  and  facts  about  reference  to  derive  from  a  scope  forest 
sf  a  new  scope  forest  sf  which  has  more  ordering  constraints.  An  ordering  constraint 
of  a  scope  forest  sf  is  a  label  of  the  form  i  <  j,  where  i  and  j  are  indices  of  operators  in 
sf  Only  paths  in  which  the  operator  with  index  i  precedes  the  operator  with  index  j  are 
associated  to  a  scope  forest  s/ labeled  with  the  ordering  constraint  i  <  j.  For  example,  the 
scope  forest  equivalent  to  the  subset  of  interpretations  of  (37)  in  which  most  undergrads 
takes  scope  over  every  male  student  is  represented  by  the  DRS  in  (40). 


(40) 


{2<1} 


Note  that  only  one  path  is  associated  to  the  scope  forest  (40),  which  means  that  the  inference 
rules  for  disambiguation  defined  below  would  apply. 


Negation  and  Indefinites 


Before  introducing  the  rules  of  inference  I  need  to  fill  in  a  few  details.  The  first  question  is 
how  to  represent  the  ambiguities  of  scope  originated  by  operators  like  negation.  The  answer 
is  that  the  tools  introduced  so  far  are  sufficient  to  represent  scope  ambiguities  originated 
by  negation,  provided  that  we  also  index  the  negation  operator.  The  representation  of  the 
sentence  John  doesn’t  have  a  car,  for  example,  will  be  the  scope  forest  (41).  This  method 
can  also  be  used  for  modal  operators. 


(41) 


V? 


j  -<iHAVE  <  a2 


car(x) 


0 


*This  name  has  historical  reasons.  ‘Scope  maze’  or  something  similar  would  probably  be  more  appropriate 
assume  that  the  logical  form  is  generated  as  proposed  by  Schubert  and  Pelletier  [1982]. 
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The  second  question  is  how  to  nnake  the  ‘path’  idea  work  with  sentences  like  (1),  since  the 
representation  for  indefinites  in  DRTo  does  not  consist  of  a  restriction  and  a  scope.  My 
answer  is  that  it  is  possible  to  represent  indefinite  NP’s  with  structures  similar  to  those 
used  for  quantifiers  without  changing  the  properties  that  indefinites  have  in  standard  DRT. 
It  is  possible,  for  example,  to  represent  the  disambiguated  reading  of  (1)  in  which  every 
student  scopes  over  an  undergrad  as  in  (42). 

(42) 

Two  properties  of  indefinites  have  to  be  preserved.  The  first  property  is  that  indefinites, 
unlike  quantifiers,  are  not  subject  to  the  scope  constraint,  as  shown  by  the  contrast  in 
acceptability  between  (43a)  and  (43b): 

,  .  a.  A  dogi  came  in.  Itj  sat  under  the  table 
'  ^  b.  Every  dog,  came  in.  ??It,  sat  under  the  table 

It  has  been  shown  however  that  we  can  model  this  distinction  semantically,  and  still  repre¬ 
sent  indefinites  with  structures  like  (42)  [Rooth,  1987;  Schubert  and  Pelletier,  1988].  We 
can  do  this  by  separating  the  class  of  referential  DRS’s®  used  to  represent  determiners 
like  a  and  the,  and  the  pronouns,  from  the  class  of  quantified  DRS’s,  used  to  represent 
determiners  like  most  and  every.  Both  classes  of  DRS’s  will  have  a  restriction  and  a  scope, 
but  they  will  have  different  semantic  properties:  in  particular,  indefinite  NP’s  will  have  the 
same  accessibility  properties  that  they  have  in  DRTo. 

A  second  reason  for  not  treating  indefinites  as  quantifiers  is  that  unselective  operators 
like  the  universal  and  the  conditional  seem  able  to  bind  indefinites,  but  not  generalized 
quantifiers.  Again,  this  does  not  prevent  using  a  representation  like  (42)  in  which  indefinites 
have  a  restriction  and  a  scope.  If  desired^,  one  can  achieve  the  same  semantic  effects  of 
DRT  by  giving  to  generalized  quantifiers  the  capability  of  imposing  constraints  on  the  set 
of  verifying  embeddings,  as  shown  in  more  detail  in  section  6. 

In  a  word,  representing  indefinite  NP’s  as  in  (42)  doesn’t  imply  that  they  get  different 
anaphoric  properties,  nor  that  generalized  quantifiers  have  different  properties  in  DRTi  than 
they  have  in  DRTo,  unless  this  change  is  otherwise  motivated.  (If  desired,  we  could  even 
define  ‘simplification  rules’  for  transforming  the  structures  associated  to  referential  NP’s  into 
the  representation  more  traditionally  associated  to  indefinites  in  DRT.)  We  can  therefore 
use  the  scope  forest  notation  for  (1)  as  well.  The  scope  forest  into  which  (1)  is  translated 
will  be  as  follows: 

(44) 

*Bad  as  this  name  sounds,  the  only  alternative  that  came  io  my  mind  was  ‘article  DRS’s’,  which  is  even 
more  misleading,  because  referential  DRS’s  are  also  used  to  represent  pronouns. 

^That  is,  leaving  aside  the  weU-known  objections  raised  against  the  unselective  quantification  account: 
the  proportion  problem,  presented  in  more  detail  in  section  6,  puts  under  discussion  the  claim  for  generalized 
quantifiers,  while  Schubert  and  Pelletier  [Schubert  and  Pelletier,  1988]  present  counterexamples  to  the  claim 
for  conditionals. 
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A  remark  on  the  notation;  in  order  to  distinguish  between  referential  NP’s  and  quantifiers  in 
the  scope  forest  representation,  I  will  use  parentheses  instead  of  angle  brackets  for  referential 
NP’s;  brackets  will  be  used  when  any  deternainer  is  possible. 


Plural  Anaphora  to  Quantifiers 

While  intersentential  singular  anaphora  to  every-NP’s  and  indefinites  in  the  scope  of  a 
quantifier  is  subject  to  a  number  of  restrictions®,  intersentential  plural  anaphora  is  generally 
possible,  as  shown  by  the  contrast  between  (43b)  and  (45),  as  well  as  by  the  contrast  between 
(46a)  and  (46b). 

(45)  Every  dog,  came  in.  They,  sat  under  the  table. 

a.  Every  person  with  a  dog,  came  in.  ??It,  was  put  under  the  table. 

'  b.  Every  person  with  a  dog,  came  in.  They^  were  put  under  the  table. 

Knowing  that  they  and  a  dog  are  anaphorically  related  in  (46b)  is  useful  to  disambiguate. 
I  wiU  therefore  introduce  a  formalization  of  the  facts  about  plural  anaphora  to  singular 
NP’s  that,  without  giving  an  explanation  of  the  phenomenon,  will  make  it  possible  to  use 
these  facts  in  the  inference  rules  for  disambiguation.  I’ll  borrow  the  necessary  notation  from 
Link’s  LP  logic  [Link,  1987].  In  a  model  for  LP,  the  universe  of  discourse  is  not  a  set,  but 
a  complete  semilattice  (E,  V)  which  contains  all  the  ‘sums’  of  the  (atomic)  individuals  of  a 
set  A  C  £.  An  embedding  defined  over  such  model  can  assign  to  a  marker  either  an  atomic 
individual  in  A,  or  an  element  of  E- A,  I  can  then  introduce  the  logical  predicates  ATOM(ar), 
true  iff  the  value  associated  to  x  in  Link’s  model  is  an  element  of  A,  and  GROUP(x),  true  iff 
that  value  is  in  E  —  A.®  An  important  property  of  atom(x)  and  group(i)  is  summarized 
by  the  following  lemma; 

Lemma  5.1  For  every  marker  x  it  is  either  the  case  that  atom(i)  or  that  group(x),  but 
not  both. 

The  semantics  of  quantified  DRS’s  like  Ki  will  be  defined  as  the  result  of  an 

operation  called  distancing  applied  to  the  set  {{f,  g)}  of  pairs  of  embeddings  such  that  / 
verifies  the  truth  conditions  of  Ki  and  g  is  one  of  the  embeddings  which  extend  / 

by  giving  values  to  the  markers  in  Ki  which  verify  both  K\  and  K2.  Embeddings 

are  ways  of  encoding  situations,  and  distancing  can  be  understood  in  terms  of  situations, 
as  a  ‘change  in  perspective’;  in  the  situations  encoded  by  the  embeddings  produced  from 
distancing  we  do  not  perceive  any  more  the  individual  events  and  single  objects,  but  only 
the  situation  in  its  totality  and  the  sets  of  objects  involved.  After  distancing,  only  the 
projections  of  the  NP’s,  that  is,  the  sums  of  objects  playing  certain  roles  in  the  global 
situation,  are  available  for  discourse  anaphora.  This  is  formalized  in  terms  of  embeddings 

'Roberts  discusses  some  cases  in  which  it  is  possible  [Roberts,  1987];  see  also  [Poesio  and  Zucchi,  1991]. 

'Link’s  model  has  already  been  used  by  Kamp  to  represent  conjoined  and  plural  NP’s  in  DRT  [Kamp, 
1988].  Both  ATOM  and  GROUP  are  mine. 
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by  having  distancing  return,  when  applied  to  a  set  {{f,  g)}  of  pairs  of  embeddings,  a  new 
set  {{f,  h)}  such  that  h  gives  to  all  markers  introduced  in  Ki  a  value  that  is  the 

sum  of  the  values  given  to  them  by  all  the  extensions  of  /  in  the  input  set.  For  example, 
the  denotation  of  the  quantified  DR5  in  (42)  will  be  a  set  {(f,  h)}  such  that  for  every  pair 
(f,  /i),  h{x)  is  the  sum  of  all  male  students  which  date  an  undergrad,  and  h{y)  is  the  sum 
of  all  undergrads  which  are  dated  by  a  male  student.  h(x)  is  the  projection  of  every  male 
student,  and  h{y)  is  the  projection  of  an  undergrad. 

According  to  this  account,  the  contrast  between  (43a)  and  (43b),  as  well  as  the  accept¬ 
ability  of  (45),  are  due  to  the  fact  that  the  projection  of  a  dog  in  (43a)  is  a  unique  individual, 
and  therefore  available  for  individual  anaphora,  while  the  projection  of  every  dog  in  (43b) 
and  in  (45)  is  the  set  of  all  dogs^°. 


5.3  Reasoning  With  Scope  Forests 


What  can  we  do,  then,  with  a  logic  with  scope  forests?  First  of  all,  we  can  do  every¬ 
thing  that  we  can  do  with  first  order  logic,  since  the  normal  inference  rules  (like  modus 
ponens  and  resolution)  are  still  sound.  Second,  we  can  infer  certain  consequences  without 
disambiguating.  Third,  we  can  use  the  information  in  the  discourse  to  disambiguate. 


Reasoning  without  Disambiguating 


How  can  we  do  inference  without  disambiguating?  In  order  to  do  that,  we  need  to  define 
inference  rules  analogous  to  first  order  logic’s  Universal  Instantiation  (UI)  and  Existential 
Generalization  (EG).  It’s  easy  to  see  how  such  rules  can  be  defined  (and  semantically  jus¬ 
tified)  in  the  framework  I  have  been  proposing.  I  will  give  as  an  example  the  scope  forest 
version  of  Universal  Instantiation;  ‘weak’  versions  of  Existential  Generalization  and  Exis¬ 
tential  Instantiation  can  be  defined  in  the  same  way. 


’°This  account  provides  a  justification  for  the  operation  of  summation  introduced  in  [Kamp,  1988],  but 
of  course  doesn’t  solve  the  well-known  problems  raised  by  (47a)  and  (47b). 


(47)  a.  Each  student  walked  to  the  stage.  He  shook  hands  with  the  dean  and  left.  (Partee)  . 

b.  Each  Italian  loves  his  car.  He  rides  it  every  Sunday. 


In .  [Poesio  and  Zucchi,  1991]  we  propose  that  distancing  is  blocked  if  the  discourse  has  a  certain  structure 
-  for  example,  the  sentence  which  contains  the  anaphoric  reference  is  the  continuation  of  an  episode  along  a 
known  ‘script’. 
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WUI  (Weak  Universal  Instantiation)  :  from  the  sentences  Every  male  student  dates 
an  undergrad  and  John  is  a  male  student  conclude  John  dates  an  undergrad. 


In  this  rule,  as  in  the  rules  which  I  will  present  in  the  next  section,  OC  is  a  set  of  ordering 
constraints,  and  the  format  is  that  specified  in  section  4.2.  Similar  rules  hold  when  a  is 
replaced  by  many,  most,  etc. 


Inference  Rules  for  Scope  Disambiguation 

We  can  derive  information  about  the  intended  scope  relations  comes  from  a  variety  of 
sources.  Three  kinds  of  sources  seem  especially  important: 

1.  Logical  facts,  like  the  fact  that  the  sentence  A  male  student  is  dating  an  undergrad 
has  only  one  intepretation. 

2.  Anaphoric  facts:  K  sentence  (1),  Every  male  student  dates  an  undergrad,  is  followed  by 
They  meet  them  at  parties,  and  we  may  conclude  that  either  them  or  they  is  anaphoric 
to  an  undergrad  in  (1),  that  is,  w'e  may  conclude  that  the  projection  of  an  undergrad 
in  (1)  is  not  a  single  person,  but  a  group  of  people,  then  we  may  also  conclude  that 
every  male  graduate  scopes  over  an  undergrad. 

3.  World  knowledge.  For  example,  one  may  use  facts  about  the  social  rules  of  dating 
to  infer  that  the  most  likely  interpretation  of  (1)  is  the  one  in  which  every  student 
takes  scope  over  an  undergrad.  As  this  very  example  shows,  however,  most  of  this 
information  cannot  be  taken  as  conclusive,  and  therefore  rules  of  this  type  are  only- 
appropriate  with  a  logic  which  allows  for  revisions. 


I  will  present  an  example  of  scope  forest  reduction  rule  based  on  ‘logical’  facts  and  two 
examples  of  rules  based  on  ‘anaphoric’  facts.  (There  is  no  pretense  of  completeness:  the 
only  reason  why  I  give  these  specific  rules  is  that  they  will  get  the  examples  in  section  5.4 
through.)  I’ll  then  present  the  rule  to  for  deriving  a  disambiguated  DRS  from  a  scope  forest. 
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ROR  (Referential  Over  Referential)  :  This  rule  reflects  the  logical  fact  that  referen¬ 
tial  NP’s  do  not  create  scope  ambiguities:  in  A  man  saw  a  dog^  for  example,  the 
relative  scope  of  A  man  and  a  dog  does  not  matter. 


RAOQ  (Referential  Atom  Over  Quantifier)  :  This  rule  allows  the  reduction  of  the 
scope  forest  associated  with  sentences  like  Every  male  student  dates  an  undergrad 
once  it  has  been  concluded  that  an  undergrad  refers  to  a  single  individual,  that  is,  the 
projection  of  an  undergrad  is  an  atom  in  Link’s  sense.  It  is  worth  remembering  that 
because  of  distancing,  y  has  different  values  ‘inside’  and  ‘outside’  of  a  quantified  DRS 
(therefore,  of  a  scope  forest):  ‘outside’  it  denotes  the  projection  of  the  referential  NP. 


QORG  (Quantifier  Over  Referential  Group)  :  This  rule  enables  us  to  conclude,  from 
the  fact  that  the  projection  of  an  indefinite  NP  is  a  group  and  the  indefinite  NP  is  in 
a  scope  forest  with  a  quantifier,  that  the  quantifier  takes  wide  scope.  (Consider  for 
example  the  case  when  (1)  is  followed  by  They  meet  them  at  parties.) 


Last  but  not  least,  we  need  to  be  able  to  derive  a  disambiguated  DRS  from  a  scope  forest 
associated  to  a  single  path.  The  simplest  way  for  doing  this  is  to  introduce  a  rule  of  inference 
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whose  premise  is  a  scope  forest  associated  to  a  single  path,  as  follows: 

SFE  (Scope  Forest  Elimination)  :  a  scope  forest  which  associated  to  a  single  path  can 
be  replaced  by  the  corresponding  interpretation. 


p(i) 


ay 

Q(y) 


With  this  way  of  writing  the  rules  it  will  be  simple  to  show  how  the  derivations  work  in 
section  5.4,  but,  of  course,  one  would  then  need  one  such  rule  for  each  permutation  of  the 
indices  -  120  rules  for  a  sentence  with  5  quantifiers,  for  example.  Plus,  one  would  require  one 
such  rule  for  every  number  of  arguments.  In  a  word,  it  would  seem  that  the  combinatorial 
explosion  that  I  was  throwing  out  of  the  door  is  coming  back  through  the  window. 

It  is  not  so  in  practice,  however.  First  of  all,  the  particular  notation  for  scope  forests  I 
have  been  using  has  been  chosen  to  preserve  the  similarity  with  the  logical  forms  proposed 
in  the  NLP  literature  as  much  as  possible.  With  this  notation,  however,  one  can  only  write 
rules  which  apply  to  scope  forests  with  a  fixed  number  of  arguments,  and  these  rules  are 
normally  asymmetrical,  in  the  sense  that  the  argument  position  is  significant.  It  should  be 
clear  however  how  rules  for  n-arguments  scope  forests  could  be  written,  as  well  as  rules  in 
which  the  argument  position  is  reversed.  There  are  notations  in  which  more  general  rules 
can  be  written,  but  I  preferred  not  to  use  them,  since  they  are  pretty  opaque. 

As  for  the  potentially  more  dangerous  problem  of  requiring  n!  rules  to  disambiguate  a 
scope  forest  with  n  arguments,  a  moment’s  thought  will  reveal  that  we  don’t  really  need 
those  many  rules:  all  that  is  really  needed  is  one  rule  which  is  triggered  when  the  ordering 
constraints  define  a  total  order  (that  is,  when  there  is  a  path  of  length  n  —  \  from  an  index  to 
another  index),  and  that  ‘extracts’  the  operators  one  after  the  other  from  the  scope  forest. 
While  it  is  difficult  to  present  such  a  rule  in  a  simple  way,  it  is  not  difficult  to  develop  an 
inference  procedure  which  does  this  without  really  going  through  n!  rules.  I  have  a  very 
simple  minded  algorithm  which  can  add  a  new  constraint  and  discover  if  the  constraints 
define  a  total  order  in  O(n^).  Once  it  has  been  determined  that  the  constraints  define  a 
total  order,  the  operators  can  be  ‘extracted’  from  the  scope  forest  one  after  the  other  in 
linear  time  using  a  procedure  similar  to  the  one  that  will  be  used  in  section  6  to  define  the 
set  of  paths  associated  to  a  scope  forest. 

5.4  Reasoning  with  Scope  Forests:  Examples 

In  this  section  I  will  show  by  example  what  one  can  do  with  DRTi.  The  two  examples 
will  show  how  one  can  formally  derive  an  unambiguous  interpretation  from  a  scope  forest 
using  the  inference  rules  presented  in  the  previous  section.  In  this  way,  I  will  be  able 
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to  take  advantage  of  the  examples  to  show  how  scope  disambiguation  can  interact  with 
other  discourse  disambiguation  processes,  namely,  reference  disambiguation.  I  will  not  give 
explicit  examples  of  application  of  WUI;  however,  the  derivations  presented  below  should  be 
explicit  enough  that  the  reader  should  not  have  problems  in  reconstructing  the  derivation 
of,  e.g.,  John  dates  an  undergrade  from  the  scope  forest  representation  of  Every  male  student 
dates  an  undergrad  and  of  John  is  a  male  student.  I  will  first  present  a  simple  model  of 
reference  disambiguation,  then  show  the  derivations. 

An  Elementary  Model  of  Reference  Disambiguation 

The  set  of  possible  anaphoric  antecedents  of  a  pronoun  x  (its  anchoring  set)  initially 
includes  all  the  markers  accessible  to  x  according  to  a  definition  of  accessibility  which  is 
essentially  that  of  DRT,  modulo  the  accessibility  of  plural  markers  for  quantifiers.  The  initial 
anchoring  set  does  not  include  those  markers  ruled  out  by  binding  constraints  [Reinhart, 
1983],  since  the  parser  introduces  a  disjointness  condition  z  ^  y  for  each  such  marker  y. 

1  will  need  a  logical  predicate  for  talking  about  accessibility  in  the  object  language. 
The  relation  between  two  markers  z,  -<  Xj  holds  whenever  z,  is  accessible  from  Zj,  that 
is,  whenever  z,  is  introduced  ‘before’  Xj.  Semantically,  z,  -<  Zj  will  be  defined  to  hold 
whenever  i  <  j  (remember  that  each  new  marker  has  a  greater  index  than  any  of  the 
markers  introduced  before,  and  that  no  two  markers  are  allowed  to  have  the  same  index) 
and  the  embedding  gives  values  to  both  Zj  and  Xj. 

I  will  also  make  the  simplifying  assumption  that  if  the  marker  z  is  the  representation 
of  a  pronoun,  then  z  is  coindexed  with  one  of  the  accessible  markers.  This  assumption  is 
encoded  by  the  following  axiom* 

Axiom  5.1  Let  zi . .  .z„  be  all  the  markers  for  which  Xj  ■<  Zn+i  is  true,  and  let  z„4.i  be 
introduced  by  a  pronoun.  Then  z„+i  =  Zj  V  . .  •  V  z„+i  =  z„  is  also  true. 

The  first  reference  disambiguation  rule  adds  new  disjointness  conditions,  thus  eliminating 
elements  from  the  anchoring  set.  Let  the  *  operator  be  defined  as  follows: 


) 

The  ASTR  rule  says  that  if  two  markers  z  and  y  are  not  of  the  same  type,  that  is,  a  predicate 
p  is  true  of  the  marker  z  but  not  of  y,  then  y  and  z  are  disjoint. 

ASTR  (Type  Reduction)  : 

P‘(z),  ^P*(y) 

*  #  y 

*'Thi8  axiom  is  too  strong  in  general,  but  will  do  for  the  moment.  Consider  the  sentence  He  came  in,  and 
suppose  that  John  and  Bill  are  the  only  available  referents.  Using  the  disjunction  method,  we  would  obtain 
as  a  representation  of  the  sentence  that  it  is  either  the  case  that  John  came  in  or  that  Bill  came  in.  Imagine, 
however,  that  neither  John  nor  Bill  came  in.  The  theory  would  then  predict  that  He  came  in  is  false,  which 
doesn’t  seem  the  right  prediction:  most  people  would  conclude  instead  that  the  referent  of  he  is  neither  John 
nor  Bill.  This  is,  I  think,  yet  another  argument  against  using  identifying  ambiguity  with  disjunction. 


(*OP)  P*(z)  =def  (ATOM(z  )  A  p(z))  V  (group(z)  a 


U-' 

V  ^  T. 


every, 


p(y) 
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When  the  anchoring  set  of  x  consists  of  a  single  element  z  the  parameter  anchoring 
(PA)  rule  applies.  Parameter  anchoring  lets  us  infer  a  coindexing  relation  x  =  y  between  a 
marker  x  and  a  marker  y  whenever  every  other  marker  z  accessible  from  x  has  been  found 
to  be  distinct  from  x: 

PA  (Parameter  Anchoring)  : 
y  -<  X 


X  =  y 


every, 


X  ^ 


Disambiguation  By  Deduction 

In  this  section  I  will  show  how  each  disambiguated  interpretation  of  sentence  (1)  can 
be  derived  from  the  scope  forest  representation,  given  the  appropriate  context.  The  follow¬ 
ing  theorem  (a  simple  corollary  of  a  lemma  presented  in  section  5.1)  will  be  used  in  the 
derivations: 

Theorem  5.1  Given  any  two  markers  x  and  z,  t/ATOM(x)  is  true  and  x  =  z  is  true,  then 
atom(x)  is  also  true. 

Let  us  consider  again  sentence  (1),  repeated  here  for  convenience. 

(1)  Every  male  student  dates  an  undergrad. 

Let  us  now  suppose  that  sentence  (1)  is  followed  by  sentence  (49). 

(49)  I  met  her. 

In  Fig.  1 1  show  how  one  can  deduce  a  wide  scope  reading  for  an  undergrad  in  (1)  using  the 
fact  that  it  is  coindexed  with  her  in  (49).  The  first  five  lines  of  the  derivation  are  premises, 
the  result  of  the  translation  of  sentences  (1)  and  (49).  Line  6  is  obtained  from  line  3  using 
the  definition  of  woman*.  (Either  z  is  an  atom  or  a  group;  either  way,  line  6  is  valid.)  Line 
7  is  derived  from  line  1  using  the  same  reasoning  to  conclude  MALE-student*(x)  and  then 
using  world  knowledge.  At  this  point  we  can  apply  ASTR  to  derive  line  8  from  lines  4,6  and 
7.  Since  y  is  the  only  marker  left  in  the  anchoring  set  of  z,  we  can  apply  PA  to  derive  line  9. 
Line  10  can  be  derived  using  the  fa^t  that  (49)  describes  a  stage  level  sentence  (and  some 
facts  about  singular  anaphora  to  quantified  NP's  presented  in  [Poesio  and  Zucchi,  1991]). 
At  this  point,  we  can  use  theorem  5.1  to  conclude  that  the  projection  of  an  undergrad  is 
also  an  atom,  and  therefore  we  can  use  RAOQ  to  derive  line  12,  from  which  we  can  derive 
line  13  using  SFE,  if  desired. 
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1.  [<every  1  [x  nale- 

-8tttdent(x)]>  datas  (a  2  [y  undargrad(y)] )] ,  0 
(translation  of  ‘Every  Bala  student  datas  an  undergrad’) 

2.  X  <>  y 

(translation  of  'Every  Bala  student  datas  an  undergrad’, 
possibly  incorporating  sobs  fora  of  indefiniteness  effect) 

3.  [apaakar  aat  (bar 

3  [z  uoBan(z)])],  0 
(translation  of  ‘I  Bet  bar’) 

4.  X  <  z 

B.  y  <  z 

6.  «OBaiia(z) 

(definition  of  BOBan*) 

7.  'voBana(z) 

(world  knouladge) 

8.  z  <>  X 

(4,  6,  7,  ASTR) 

9.  z  =  y 

(S,  2.  4.  8.  PA) 

lO.atoffl(z) 

(uorld  knouladge) 

ll.atoa(y) 

(9,  10,  Tbaorea  6.1) 

12. t<«v*ry  1  Lx  Ml«-stud«nt(x)]>  dat*«  (a  2  [y  imdargrad(y)] )] ,  {2  <  1> 

(1,  11,  RAOq) 

13.  Cy  undargradCy)]  -a->  [[x  Bala-studantCx)]  “aTary->  [x  dataa  y]] 

(12,  SFE) 

Figure  1:  Wide  scope  for  an  xmdergrad 

Next,  an  example  of  disambiguation  in  which  -every  male  student  takes  wide  scope. 
Suppose  that  (1)  is  followed  by  the  following  sentence: 

(50)  They  meet  them  at  parties. 

The  derivation  of  a  wide  scope  reading  for  every  male  student  is  shown  in  Fig.  2. 
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1.  and  2. 

as  before. 

3.  [<they  3  [z  group(z)]>  neet-at-parties  <they  4  [■  group(B)]>],  0 

(translation  of  ‘They  sieet  them 

at  parties . ’ ) 

4.  and  5.  : 

as  before 

6.  Z  <>  B 

(translation  of  ‘They  aeat  them 

at  parties . ’ ) 

7.  V  -  X  or  u  ~  j 

(axiom  6.1) 

8.  z=xorz=y 

(axiom  6.1) 

9.  y*Bory*z 

(6,  7.  8) 

lO.group(z) 

(part  of  the  meaning  of  ‘they’) 

ll.gTonp(B) 

(part  of  the  meaning  of  ‘they’ 

12.group(y) 

(9,  10.  11) 

13. [<ev«ry  1  [x  male-8tud«nt(x)]>  dates  (a  2  [y  nndergrad(y)] )]  ,  <1  <  2> 

(1.  12.  qORG) 

14.  [x  Bale-student(x)]  -every->  tCy  uadergrad(y)]  -a->  [x  dates  y]] 

(13,  SFE) 

Figure  2;  Narrow  scope  for  an  undergrad 
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6  A  Formal  Presentation  of  DRTi 

6.1  The  Syntax 

The  main  syntactic  differences  between  DRTo  and  DRTi  axe  the  following: 

1.  Universally  quantified  sentences  and  conditionals  axe  now  represented  as  two  different 
classes  of  expressions:  the  class  of  quantifier  DRSs,  and  the  class  of  connective 
DRSs. 

2.  A  new  class  of  complex  conditions  is  introduced,  the  class  of  referential  DRSs,  to 
represent  indefinites,  definites  and  pronouns.  These  kinds  of  NP’s  axe  therefore  syn¬ 
tactically  separated  from  proper  names. 

3.  Another  new  kind  of  complex  condition  is  introduced,  the  scope  forest. 

4.  Operators  like  quantifiers  and  negation  are  given  indices. 

The  set  of  symbols  of  DRTi  includes,  in  addition  to  the  set  of  symbols  of  DRTo,  a  set  of 
indices  0 . . .  i . . .;  a  set  QDet  =  {every,  most  }  of  quantifier  operators;  and  a  set 
RDet  =  {a,  the,  he,  it,  she,  they}  of  referential  operators.  The  set  of  expressions  of 
DRTi  consists  of  marker  introducers  (defined  as  before),  conditions,  and  DRSs.  In  addition 
to  the  unary,  binary  and  coindexing  conditions  of  DRTo,  DRTi  includes  the  following  types 
of  conditions: 

1.  disjointness  conditions  of  the  form  x  ^  y,  where  x  and  y  are  markers. 

2.  accessibility  conditions  of  the  form  x  <y,  where  x  emd  y  are  msu'kers. 

3.  the  structural  conditions  atom(x)  and  GROUP(z),  where  x  is  a  marker. 

4.  negated  DRSs,  that  are  expressions  of  form  -‘iK,  where  A"  is  a  DRS  and  i  an  index; 

5.  connective  DRSs,  like  K\  (used  to  represent  the  conditional),  where  K\ 

and  K2  are  DRSs; 

6.  quantified  DRSs,  which  are  expressions  of  the  form  K\  ^  K2,  K\  is  of  the  form 

a* 

P(j) 

d  6  QDet,  and  A'j  is  a  DRS;  Ki  will  be  called  restriction,  K2  scope,  and  x  the 
main  marker; 

7.  referential  DRSs,  again  of  the  form  A^i  ^  A'2,  where  d  €  RDet,  and  K\  and  A'2 
are  DRSs;  again,  A^j  will  be  called  restriction  and  K2  will  be  called  scope; 

8.  scope  forests,  which  are  expressions  of  one  of  the  following  types: 
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<  di 


Q 

P(l) 


>  R  <  d'; 


a» 

Q(y) 


oc 


where  d  and  d'  are  in  QDet  U  RDet;  R  is  either  a  relation  or  a  negated  relation  of 
the  form  -i,  r'  where  r'  is  a  relation;  and  OC  is  a  set 

{i  <  j,. .  .k  ^  1}  of  ordering  constraints  among  the  indices  of  the  operators  in  the 
scope  forest. 

A  formula  of  DRTi  is  a  DRS  with  no  free  markers,  and  in  which  no  two  operators  are 
given  the  same  index.  The  donkey  sentence  Every  farmer  who  owns  a  donkey  beats  it  is 
represented  as  in  (51)  (which  the  reader  should  compare  to  the  DRTq  representation  (35), 
section  4.2). 


(51) 


a' 

FARMER(l) 


1 

0WNS(x,y) 

DONKEY(y) 

every, 


z  =  y 

it. 

BEATS(l,r) 

6.2  The  Semantics 


A  model  M  for  DRTi  is  a  pair  (JJ,  F):  is  an  interpretation  function,  and  is  a  Link- 
type  boolean  algebra  <  i?,A,<,U  >  in  which  <  models  the  inclusion  relation,  A  is  the  set 
of  atoms  of  D,  i.e.,  those  elements  of  D  such  that  no  other  element  is  included  in  them,  and 
U  models  Link’s  sum  (that  is,  x  U  y  is  the  minimal  element  of  U  which  includes  both  x  and 

y  )■ 

The  denotation  of  a  DRTj  expression  with  respect  to  M  is  defined  as  foUows.  The 
denotation  of  the  basic  conditions  and  of  the  marker  introducers  is  the  same  as  in  DRTq. 
The  new  atomic  conditions  have  the  following  denotations: 

1- Ik  =  />!/(*)  i^/(3/)}  • 

2.  ||ATOM(x)r  =  {a,/)|/(x)€A} 

3.  ||GROUP(x)r  =  {(f,  f)\  fix)  eU-A) 

4.  ||x,  -<  XjW^  =  0  if  t  >  j,  {(f,  f)\  Xijtj  €  dom(/)}  otherwise. 

The  denotation  of  the  ‘basic’  DRS  changes,  since  accessibility  is  now  controlled  by  discourse 
structure  and  the  type  of  NP.  Rather  than  pairs  (f,  /),  a  basic  DRS  wiU  denote  pairs  {f,  g) 
where  g  contains  additional  values  for  the  markers  introduced  by  the  DRS. 
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ir  =  {(f,  <7)1  fCfiC...CfnCgiC...C  C  g, 

(f,  /i)  €  ||o">|l^  and  ...(fn.u  In)  e  ||a*"||A^  and 
(^n,<7i)€|lCill^...  9)llC„.r} 

The  semantics  of  quantifier  and  referential  DRSs  will  be  a  straightforward  extension  of  the 
semantics  of  the  conditional  DRS  in  DRTq.  Let  us  first  introduce  the  following  definitions. 
Let  Ki  ^  A'2  be  a  quantifier  or  referential  DRS,  with  determiner  d,  index  t,  and  main 
marker  x.  Let  /  be  an  embedding  such  that  x  ^  dom(/).  The  set  of  DBS-Satisfying 
Embeddings,  DSE,  is  defined  as  follows: 

DSE(/,A')  =  {h|  {f,h)e||^rr} 

and  extended  as  follows  to  sets  of  embeddings: 

dse({/i,...,/„},A’)  =  {hi  {fi,h)  €  llA'II^  for  some  i  e  l...n} 

I  need  now  to  make  the  notion  of  ‘distancing’  introduced  in  section  5.1  more  precise.  As 
said  there,  distancing  has  the  effect  of  ‘changing  the  perspective’  on  a  certain  situation, 
that  is,  making  available  ‘outside’  the  DRS  only  the  sets  of  participants  to  a  situation  ‘as  a 
whole’,  so  that  whereas  before  the  closure  the  markers  are  available  for  singular  anaphora, 
after  they  are  only  available  for  plural  anaphora. 

Definition  6,1  Let  E  be  a  set  of  embedding  pairs  {i/,g}}.  DISTANCING(£?)  is  the  set  of 
embedding  pairs  E*  =  {{f,  h)}  defined  as  follom.  Let  i,-  ...Xj  be  the  markers  over  which  g 
extends  /.  Then  {f,  h)  will  be  in  DlSTANClNG(f?),  where  h  is  the  embedding  which  is  like  g 
for  the  markers  up  to  ii_i,  and  then  for  I  =  i..  .j,  h{xi)  =  hi(xi)  U  . . .  hk{xi) . . .,  for  any 
hk  s.t.  if,  hk)  €  E.^^ 

We  can  now  specify  the  denotations  of  quantified  and  referential  DRS’s: 

1.  IIA"!  A'211^  =  distancing({  (J,  g),  where  /  is  an  embedding  such  that  for  every 
h  €  dse(/,A’i)  there  is  an  h'  s.t.  h'  €  DSE(h,A'2)>  and  g  €  DSE(dse(/,  A'l),  A^2)}) 

2.  (|A'i  *"£$*•  ||A/  _  distancing({  (f,g)\  where  /  is  an  embedding  such  that  for  most 

h  €  dse(/,  A'l)  there  is  an  h'  s.t.  h'  €  DSE(h,A’2),  and  g  €  dse(dse(/,  A'l),  A'2)})^^ 

3.  IIA^  A'211^  =  {  (J,g)\  dse(dse(/,A',),A'2)  0  and  e  dse(dse(/, A^), A'2)} 

’^This  semantics  is  correct  for  the  examples  in  the  previous  section,  but  also  predicts  that  in  texts  like 
Every  man  who  owns  a  donkey  beats  it.  The  donkeys  hate  their  owner.,  The  donkeys  would  refer  to  the  set 
of  all  donkeys  beaten  by  any  man.  To  fix  this  problem  it  would  be  necessary  to  modify  the  defintion  of 
distancing  to  allow  for  ambiguities. 

’^This  definition  suffers  from  the  so  called  proportion  problem  [Heim,  1982;  Rooth,  1987].  Let  us  consider  a 
model  with  100  farmers,  99  of  which  own  a  donkey  and  don’t  thrive  whereas  one,  Pedro,  owns  1000  donkeys 

and  thrives.  The  definition  of  the  semantics  of  Ki  Kj  above  would  predict  that  this  model  would 

make  Most  farmers  who  own  a  donkey  thrive  true,  which  is  rather  counterintuitive.  The  reason  is  that  if  the 
restriction  is  a  DRS  K  like 
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4.  ||A'i  A'2||'^  =  {(f,  fi)l  fix)  =  fiy)  for  some  marker  y  and  (f,  g)  €  11A"2||^} 

(Analogous  rules  could  be  developed  for  A'l  A’2  and  Ki  A'2,  of  course.)  Next,  the 
denotations  of  the  negated  DRS  and  the  conditional,  almost  identical  to  those  in  DRTq: 

1.  ll-iiA'II^  =  {(/■,  f)\  there  is  no  g  such  that  if,g)  €  ||A'||^} 

2.  II  A’l  A'211^  =  {(f>/)|  for  every  h  s.t.  (f,  h)  €  l|A'’i||^  there  is  an  /  s.t. 

(h,  1)  €  ||A''2||^^,  and  g  is  one  such  /} 

(Note  that,  because  of  these  definitions  and  of  the  definition  of  x  -<  y,  negation  and  condi¬ 
tionals  still  ‘block’  anaphoric  reference  in  the  same  way  that  they  do  in  DRT.)  Finally,  in 
order  to  define  the  semantics  of  scope  forests,  we  have  to  specify  more  precisely  the  set  of 
paths  associated  to  them.  This  definition  will  be  given  in  terms  of  an  extraction  procedure. 

Definition  6.2  An  extraction  procedure  is  an  algorithm  which  ‘pulls  out’  all  the  oper¬ 
ators  from  a  scope  forest  (p[i\ . .  one  after  the  other,  and  returns  a  disambiguated 

DRS  (a  DRS  that  is  not  a  scope  forest),  under  the  constraint  that  if  the  ordering  constraint 
i  <  j  is  in  OC,  then  the  operator  with  index  i  is  extracted  before  the  operator  with  index  j. 

What  the  ‘extraction  rules’  do  is  fairly  obvious  and  can  be  compared  to  what  Quantifier 
Raising  does.  I  will  just  give  two  examples: 


negation: 


Ti  R  T2 


TIKT2 


a 

farmer(x) 


DONKEY(y) 


OWNS(x,y) 


DSE(f,A')  will  include  a  different  embedding  for  each  pair  of  values  for  the  markers  x  and  y,  and  the  fact 
that  Pedro  owns  1000  donkeys  will  make  him  a  participant  in  1000  pairs.  The  source  of  the  problem,  in 
short,  is  that  the  pairs  are  counted,  rather  than  the  number  of  possible  distinct  values  that  may  be  given 
to  X.  Booth  presented  a  solution  to  the  problem  which  could  easily  be  accomodated  in  my  framework.  The 
idea  is  to  partition  a  set  of  embeddings  into  equivalence  classes  according  to  the  value  that  the  embeddings 
associate  to  x; 


EP(£,x),  where  £  is  a  set  of  embeddings,  and  x  is  a  marker,  =  {  £a|  each  h  €  £«  is  a  member 
of  £,  and  for  all  h  €  £a  h(x)  =  a  for  some  a  € 

and  then  replace  the  definition  of  K\  above  with  the  one  below; 

•  ||A'i  A'2  11*^  =  {  (f.  y)l  fo*  most  £  €  EP(DSE(/,  A'i),x)  there  is  an  h  €  £  s.t.  h'  €  DSE(h,  £'2) 

for  some  h'  . . . }  (y  is  defined  as  before) 

This  definition  does  not  suffer  from  the  problem  above,  but  it  does  not  explain  why  the  semantics  of  most 
should  be  defined  in  terms  of  equivalence  classes,  while  the  semantics  of  every  should  be  based  on  pairs. 
A  more  illuminating  solution  would  be  to  assume  an  ambiguity,  originated  by  the  possibility  of  choosing 
different  farmers-owning-donkeys  ‘cases’  in  the  restriction  of  a  quantifier:  developing  this  solution  in  detail, 
however,  would  go  outside  the  scope  of  this  paper. 
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indefinites: 


a* 

f 

Ti  R  (  a. 

P(x) 

) 

P(x) 

1 

Ti  K  X 

The  first  extraction  rule  extracts  a  negation  operator  from  a  scope  forest,  and  produces  as  a 
result  a  negated  DRS  with  the  scope  forest  from  which  the  operator  has  been  extracted  in  the 
scope  of  the  negation  operator.  Ti  and  T2  can  be  either  unextracted  terms  or  markers.  The 
second  rtile  extracts  an  indefinite  term  from  a  scope  forest,  and  produces  a  referential  DRS 
in  whose  scope  there  is  a  new  scope  forest  in  which  the  marker  x  has  replaced  the  indefinite 
term.  It  is  worth  observing  that  nothing  prevents  having  more  than  one  extraction  rule  per 
determiner,  or  extraction  rules  which  operate  on  the  scope  forest  as  a  whole  (this  will  take 
care  of  the  additional  readings  of  examples  like  Two  examiners  marked  six  scripts).  An 
‘empty  extraction  rule’  wiU  replace  a  scope  forest  without  operators  left  with  a  basic  DRS. 

Definition  6.3  Let  K  be  the  scope  forest  (p[ii . . .  in]OC  with  indices  ij . . .  i„  and  ordering 
constraints  OC.  The  paths  associated  to  SF  are  all  the  disambiguated  DRS’s  K'  which 
can  be  derived  from  K  by  an  extraction  procedure.  . 

We  can  now  define  the  denotation  of  a  scope  forest  as  follows: 

1.  II  K  :  (p[ii .  ,.in]OC  ||^  =  (J  HPH^,  for  every  path  P  associated  to  K 

6.3  The  Inference  Rules 

The  set  of  inference  rules  of  DRTi  includes,  in  addition  to  the  traditional  rules  (Modus 
Ponens,  Universal  Instantiation,  etc.)  all  the  rules  defined  in  section  5.1  and  5.4.  Because 
of  the  change  in  the  semantics  of  the  basic  DRS,  we  need  to  modify  the  definition  of  logical 
entailment  given  in  section  4.2  as  follows:  if  K\  and  K2  are  formulas,  Ki  |=  K2  iff  for  all 
models  M  in  which  A^j  is  true,  for  every  pair  (f,g)  €  ||A'i||^,  there  is  an  embedding  h, 
g  Q  h,  such  that  (f,  h)  €  ||A'2||^.  With  this  new  definition,  the  following  theorem  holds: 

Theorem  6.1  MP,  UI,  ROR,  RAOQ,  QORG,  SFE,  WUI,  PA  and  ASTR  are  sound  DRTi 
inference  rules. 

Proof:  I  will  show  how  one  can  prove  this  for  MP;  the  proof  for  the  other  inference  rules 
is  analogous.  Let  (f,  g)  be  a  pair  in  the  denotation  of  a  formula  K  containing  the  two 

conditions  P  and  K\  where  K\  consists  of  the  condition  P  and  K2  consists  of 

the  condition  Q.  Without  loss  of  generality,  we  can  assume  that  the  two  conditions  are  the 
‘last’  conditions  in  K.  Let  /n  be  an  extension  of  /  such  that  {/■„,  /„)  G  ||A'i  **'*.^"*  /fjH  and 
if  nig)  €  ||P||.  Because  of  the  definition  of  the  conditional  DRS,  there  must  be  an  fi  such 
that  ig,  h)  G  ||Q||.  But  then,  (f,  h)  will  be  in  the  denotation  of  the  formula  K'  obtained  by 
adding  Q  at  the  end  of  A',  QED. 


34 


7  Discussion 


It  is  useful  at  this  point  to  discuss  the  implications  of  the  scope  forest  idea,  and  to 
state  more  clearly  the  position  I  am  assuming  with  respect  to  certain  topics  in  ambiguity 
processing  which  have  been  the  object  of  much  discussion. 

7.1  Preferences 

I  have  observed  in  section  2  that  Kurtzman  and  MacDonald,  and  before  them  loup, 
answered  in  the  affirmative  to  the  question:  Do  people  actually  have  preferences?  The 
existence  of  preferences  brings  out  other  questions,  however:  (i)  Do  people  apply  these 
principles/preferences  all  the  time?  (ii)  Which  principles/preferences  are  actually  used? 
(iii)  Is  it  possible  to  derive  them  from  some  more  general  consideration?  (iv)  How  can  these 
preference  rules  be  formalized? 

I  argued  in  section  2  that  the  ‘scope  forest  theory’  is  compatible  both  with  the  ‘lazy’ 
model  of  disambiguation  proposed  in  section  1  and  with  a  ‘fanatical’  model  of  disambigua¬ 
tion  in  which  the  system  always  tries  to  come  out  with  an  interpretation.  An  intriguing 
hypothesis  about  question  (i)  is  that  whether  people  try  to  disambiguate  or  not  depends 
on  (a)  how  high  the  cost  of  disambiguating  is  and  (b)  how  important  it  is  for  them  to 
disambiguate.  Kurtzman  and  MacDonald  only  studied  sentences  with  two  quantified  NP’s, 
and  the  determiners  that  appeared  in  those  sentences  were  every,  each,  the,  the  same,  a 
and  a  different.  These  quantifiers  are  ‘easy  to  compute’  -  they  can  in  fact  be  computed  by 
DFAs  [van  Benthem,  1987].  It  would  be  interesting  to  repeat  the  experiment  with  determin¬ 
ers  which  are  ‘harder’  than  the  determiners  above,  such  as  most  and  fevr.  for  example,  to  try 
sentences  like  Most  male  students  date  few  undergrads.  Another  source  of  complexity  is  the 
number  of  possible  interpretations:  again,  the  hypothesis  would  predict  that  as  the  number 
of  interpretations  gets  larger,  people  would  disambiguate  less.  This  second  prediction  could 
be  tested  by  running  experiments  on  sentences  containing  numerical  quantifiers,  such  as 
Two  examiners  marked  six  transcripts.. 

It  is  also  possible  that  people  do  not  disambiguate  in  certain  cases  because  they  have 
no  reason  to  do  so.  Hirst  argues  convincingly  in  [Hirst,  1990]  for  a  similar  strategy  for 
systems  processing;  but  there  also  seem  to  be  linguistic  reasons  to  believe  this.  Consider 
the  following  sentence: 

(52)  For  simplicity  of  exposition,  I  will  assume  in  most  of  the  discussion 

that  variable  names  are  magically  correct  from  the  start.  [Hirst,  1987] 

The  author  is  describing  a  program  called  Absity.  Depending  on  how  the  start  is  interpreted 
(the  start  of  the  program,  or  the  moment  at  which  a  variable  is  created),  one  can  interpret  it 
to  take  scope  over  variable  names,  or  viceversa.  Both  interpretations  are  equally  acceptable: 
deciding  the  scope  of  the  start  is  not  required  for  the  understanding  of  (52). 

Further  evidence  for  the  claim  that  we  humans  disambiguate  only  when  we  have  some 
reason  to  do  so  can  be  found  by  observing  how  people  refer:  for  example,  how  they  use 
the  demonstrative  that.  Asher  discusses  the  ‘sloppiness’  exhibited  by  writers  when  using 
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that  to  refer  to  actions  or  plans  in  [Asher,  1990].  We  have  found  that  something  similar 
also  occurs  when  that  is  used  to  refer  to  objects.  For  example,  in  (53),  that  -  if  interpreted 
literally  -  could  refer  either  to  the  engine,  or  to  the  boxcar,  or  to  both: 

/eo\  We’re  gonna  hook  up  engine  £2  to  the  boxcar  at  Elmira,  and  send  that  off  to 
Corning. 

disambiguating  that  reference  is  not,  however,  necessary  to  understand  (53)  -  any  disam¬ 
biguation  will  probably  lead  to  the  same  action  on  the  part  of  the  system  which  receives 
this  order,  since  the  boxcar  and  the  engine  are  ‘hooked  up’^^ 

Questions  (ii)  and  (iii)  -  What  are  these  principles?  Is  it  possible  to  derive  them  from 
some  more  general  principle?  -  are  also  interesting  because  answering  them  might  help  in 
finding  some  logic  behind  the  disparate  set  of  Preference  Heuristics  that  have  been  proposed 
for  NLP  systems  [VanLehn,  1978;  Grosz  et  al.^  1987;  Hurum,  1988a;  Moran,  1988].  Moran 
lists  in  [Moran,  1988]  eight  preference  heuristics.  Of  these,  two  can  be  related  to  the 
Quantifier  Hierarchy  Principle  (3.1  -  wide  scope  preference  for  each  -  and  3.2  -  wide  scope 
preference  for  WH-phrases  but  not  over  each),  three  establish  a  similar  hierarchy  between 
quantifiers,  negation  and  modals  (5.1  -  strong  preference  for  some  to  outscope  negation;  5.2 
-  preference  for  negation  to  outscope  every,  and  5.3  -  preference  for  any  to  outscope  modals); 
and  two  are  the  Scope  Constraint  (presented  in  section  3)  and  May's  generalization  that 
quantifiers  cannot  be  raised  out  of  a  relative  clause.  (Hurum  [1988b]  presents  additionad 
heuristics.) 

I  am  afraid  that  an  answer  to  questions  (i)~(iii)  won't  be  had  for  some  time;  answering 
question  (iv),  however,  is  a  preliminary  step,  and  can  give  some  interesting  insights  on  the 
more  complicated  problems.  Giving  a  semantics  to  unscoped  logical  forms  does  not  rule  out 
scope  forest  reduction  rules  based  on  preferences.  These  preferences  should  be  formalized  as 
defeasible  rules,  however,  with  consequent  problems  in  formalization.  The  hardest  problems 
are  how  to  cumulate  evidence  and  how  to  choose  among  competing  hypothesis.  The  ab¬ 
duction  scheme  of  Hobbs  and  Stickel  [Hobbs  et  al.,  1988]  is  an  obvious  possibility;  another 
possibility  is  to  use  one  of  the  probabilistic  logics  recently  proposed.  Once  this  extended 
logic  is  developed,  one  could  attempt  to  characterize  the  cases  of  perceived  ambiguity  as 
follows:  one  recognizes  an  ambiguity  whenever  the  preference  rules  are  used  (that  is,  there 
are  reasons  to  disambiguate)  and  more  than  one  plausible,  but  competing  hypothesis  are 
produced.  One  does  not  perceive  the  ambiguity  if  either  the  preference  rules  are  not  used, 
or  they  do  not  produce  plausible  disambiguation  hypotheses. 

7.2  The  Role  of  Syntactic  Constraints 

One  should  not  read  in  what  I  have  written  so  far  a  claim  that  syntactic  constraints  play 
no  role  in  determining  scope  relations.  There  is  evidence  to  the  contrary:  May  [1985],  for 
example,  shows  that  sentences  like  Some  student  admires  every  professor  may  be  ambiguous 
in  isolation,  but  in  a  VP-deletion  context,  like  (54),  the  ambiguity  evaporates. 

(54)  Some  student  admires  every  professor,  but  John  doesn’t. 

'*This  sentence  has  been  excerpt  from  the  transcript  of  a  conversation  recorded  at  the  University  of 
Rochester  for  the  TRAINS  project. 
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It  has  also  been  brought  to  my  attention  by  Anil  Belvadi  (p.c.)  that  in  the  Kanada  language 
several  cases  of  scope  ambiguities  are  lexically  resolved,  that  is,  in  sentences  like  Every  male 
student  dates  an  undergrad  one  would  use  a  different  determiner  when  an  undergrad  takes 
wide  and  narrow  scope. 

That  syntactic  factors  can  be  used  to  disambiguate  is  not  a  problem  for  the  theory  I 
have  been  proposing.  All  that  I  am  proposing  is  a  method  for  resolving  certain  ambiguities 
using  semantic  and  pragmatic  knowledge;  if  the  ambiguity  can  be  resolved  syntactically,  the 
translation  procedure  will  not  produce  a  scope  forest,  but  the  appropriate  disambiguated 
DRS.  My  claim  is  much  weaker  than  the  strong  claim  of  Gawron  and  Peters  in  [Gawron 
and  Peters,  1990](cfr.  pages  53-56),  that  the  determination  of  the  scope  of  NP’s  is  entirely 
determined  by  contextual  factors  (Gawron  and  Peters  call  them  circumstances).  In  order 
to  support  this  stronger  claim  one  should  give  some  sort  of  explanation  for  data  like  (54). 

7.3  How  Many  Interpretations  are  Computed? 

Another  question  left  open  by  Kurtzman  and  MacDonald’s  experiments  is  whether  peo¬ 
ple,  when  they  do  disambiguate,  do  indeed  compute  only  one  interpretation,  or  perhaps  a 
couple  of  them  -  as  predicted  by  the  theory  presented  in  this  paper,  and  as  one  would  expect 
on  the  basis  of  ‘common  sense’  arguments  like  those  presented  in  section  1  -  or  compute  all 
interpretations  in  parallel,  as  predicted  for  example  by  the  disjunction  theory.  Kurtzman 
and  MacDonald’s  data  suggest  that  people  compute  both  interpretations  of  sentences  like 
Every  boy  climbed  a  tree  -  they  claim  that  people  always  initially  compute  both  interpre¬ 
tations  and  then  usually  quickly  adopt  one  interpretation  and  delete  the  other  [Kurtzman, 
p.c.].  Once  more,  however,  running  tests  with  sentences  with  a  large  number  of  interpreta¬ 
tions  would  be  the  only  way  of  discriminating  among  the  competing  hypotheses.  The  sheer 
number  of  interpretations  should  make  it  possible  to  verify  whether  indeed  people  compute 
them  all  and  then  ‘filter’,  or  rather  compute  a  few  (possibly  using  preference  heuristics). 

7.4  Other  Types  of  Ambiguity? 

One  may  also  wonder  if  this  kind  of  semantics  can  be  used  to  handle  different  kinds  of 
ambiguities.  The  reasoning  goes  as  follows.  When  given  a  sentence  with,  say,  a  lexical  am¬ 
biguity,  the  translation  procedure  will  produce  more  than  one  logical  form,  or,  alternatively, 
a  logical  form  containing  some  construct  for  representing  the  ambiguity  ‘in  loco’,  like  the 
Polaroid  words  proposed  by  Hirst  [Hirst,  1987].  The  second  kind  of  representation  would 
be,  of  course,  more  in  keeping  with  the  scope  forest  proposal,  but  in  both  cases  we  should 
be  able  to  assign  as  a  denotation  to  the  logical  form  a  set  of  pairs  (if  more  than  one  logical 
form  is  produced  by  the  translation,  this  set  would  be  the  union  of  the  sets  denoted  by  each 
of  these  logical  forms).  It  would  be  necessary  to  define  disambiguation  rules  and  maybe 
preference  rules  which  do  the  job  of  spreading  activation  in  Hirst’s  system,  of  course,  but  at 
least  in  principle,  this  approach  seems  practicable.  Other  kinds  of  semantic/pragmatic  dis¬ 
course  disambiguation  processes  -  for  example,  discourse  structure  disambiguation  -  seem 
amenable  to  this  kind  of  treatment  as  well.  What  this  means  is  that  we  would  be  able  to 
assign  a  denotation  to  a  disambiguated  parse  tree,  and  therefore  all  of  the  semantic  and 
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pragmatic  processing  of  the  system  could  be  done  ‘deductively’:  the  system  could  just  be 
fed  a  parse  tree. 

If  we  were  able  to  do  this,  we  would  have  a  principled  way  to  formalize  the  idea  that 
discourse  disambiguation  does  not  proceed  sequentially  -  lexical  disambiguation  first,  then 
scope  disambiguation,  then  tense  interpretation,  then  reference  disambiguation,  and  so  forth 
-  but  instead  as  a  form  of  constraint  propagation  process,  in  the  sense  that  one  starts  with  a 
set  of  constraints  which  are  compatible  with  a  large  number  of  possible  interpretations,  and 
then  proceeds  to  reduce  the  number  of  allowed  interpretations  by  inferring  new  constraints 
from  the  existing  ones.  The  idea  that  discourse  interpretation  processes  are  interrelated 
in  this  way  has  been  made  popular  by,  among  others,  J.  Hobbs  [Hobbs,  1979;  Hobbs 
et  al.,  1988],  and  methods  for  implementing  this  kind  of  disambiguation  process  have  been 
proposed,  besides  Hobbs,  by  K.  Dahlgren  [Dahlgren,  1988],  by  Charniak  and  Goldman 
[Charniak  and  Goldman,  1988;  Goldman,  1990],  and  by  the  University  of  Rochester  group 
working  on  the  Discourse  System  project  [Allen  et  ai,  1989].  On  the  psycholinguistic 
side,  a  model  of  processing  in  which  different  kinds  of  discourse  information  are  related 
has  been  proposed  by  Crain  and  Steedman  [Crain  and  Steedman,  1985].  None  of  these 
projects  was  focused  on  the  problem  of  ambiguity,  however,  so  the  criticism  to  ‘traditional’ 
systems  applies  to  each  of  them  -  in  all  of  them,  the  only  way  to  give  a  semantics  to  an 
ambiguous  statement  is  to  computing  all  the  interpretations  and  then  either  choose  one  or 
use  a  disjunctive  statement. 

One  might  even  carry  the  hypothesis  further,  and  claim  that  even  structural  ambiguities 
like  prepositional  attachment  could  be  handled  in  this  way.  That  is,  one  could  feed  the 
semantic  interpreter  a  ‘compact’  representation  for  an  ambiguous  parse  tree  of  the  kind 
proposed  by  Seo  and  Simmons  [Seo  and  Simmons,  1989],  instead  of  a  disambiguated  parse 
tree.  The  denotation  of  this  object  would  be,  once  more,  the  union  of  the  denotations  of  the 
unambiguous  parse  trees.  Serious  work  on  the  idea  is  clearly  needed  before  its  usefulness 
can  be  judged. 
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8  Conclusions  and  Future  Work 


The  main  aim  of  this  paper  was  to  show  that  a  natural  language  processing  system  can 
perform  inferences  on  the  basis  of  what  it  has  been  told  without  doing  scope  disambiguation. 
A  simple  parser- with- reasoner  system  built  according  these  ideas  has  been  implemented, 
and  the  implementation  of  a  second  version  is  under  way.  This  new  version  will  be  used  as 
the  reference  and  scope  disambiguation  module  of  the  discourse  system  TRAINS  [Allen  and 
Schubert,  1991].  My  main  goals  for  the  foreseeable  future  are  (i)  to  explore  the  properties  of 
DRTj  in  more  detail,  looking  for  counterintuitive  properties*®;  (ii)  to  extend  the  treatment 
to  cover  other  operators  which  create  scope  ambiguities,  most  notably  tense  operators, 
and  (iii)  to  develop  a  version  of  the  logic  in  which  I  can  formalize  preference  rules,  and 
compare  the  two  strategies  -  leaving  the  ambiguities  in  place,  vs.  generating  one  (or  few) 
interpretations  using  preferences. 

8.1  Reasoning  Without  Disambiguating? 

The  first  direction  of  research  is  to  discover  what  kind  of  inferences  one  can  do  with  a 
non-disambiguated  logic  -  that  is,  how  the  set  of  ‘weak’  inference  rules  proposed  in  section 
5  can  be  extended  to  a  complete  set  of  inference  rules,  and  how  this  set  will  look  like.  The 
ultimate  goal  is  to  develop  a  logic  in  which  one  never  disambiguates  -  that  is,  one  in  which 
only  scope  forests  are  used,  and  the  usual  ‘scoped’  forms  of  representation  are  only  used  as 
shorthands. 

A  crucial  question  is  whether  we  get  any  actual  improvement  in  performance  by  leaving 
the  ambiguities  in  place,  that  is,  whether  inferencing  with  DRTi  is  so  expensive  as  to  offset 
the  advantages  gained  from  not  having  to  produce  all  the  interpretations.  A  general  answer 
would  require  determining  the  complexity  of  reasoning  with  DRTj.  My  first  objective  will 
be  to  perform  an  experimental  study,  by  implementing  a  new  version  of  the  scoping  and 
reference  disambiguation  module  of  TRAINS  and  comparing  its  performance  both  with  the 
existing  module  and  with  a  module  which  works  according  to  the  proposals  of  Moran  and 
Hurum. 


8.2  Formalizing  Preferences 

To  formalize  the  kind  of  preferences  discussed  by  Kurtzman  and  MacDonald,  as  well  as 
to  represent  the  disambiguating  information  originated  from  lexical  and  world  knowledge, 
I  will  need  a  logic  in  which  it  is  possible  to  jump  to  conclusion  on  the  basis  of  uncertain 
knowledge,  and  to  choose  between  competing  possibilities. 

For  example,  it  is  a  lexical  property  of  the  predicate  comes  with  that  the  argument  in 
subject  position  takes  scope  over  the  argument  in  object  position,  so  that  Every  chess  set 
comes  with  a  spare  pawn  has  no  reading  in  which  a  spare  pawn  scopes  over  Every  chess  set. 

’^Because  of  the  use  of  partial  situations  and  the  formulation  of  the  definition  of  the  marker  introducers 
it  doesn’t  seem  to  suffer  from  some  of  the  unwanted  properties  of,  e.g.,  certain  dynamic  logics:  for  example, 
i4  -*  i4  is  a  logical  axiom. 
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In  order  to  express  such  ‘defeasible  disambiguation  axioms'  a  Disambiguation  Schema 
can  be  used,  whose  general  form  would  be 

S  — >  {i<j} 

where  S  is  the  scope  of  the  quantifier  with  the  strictest  scope  and  i  and  j  are  indices  of 
operators.  The  disambiguation  schema  for  the  predicate  comes  with,  for  example,  would  be 

[i  comes-with  j]  — >  {i  <  j}. 

This  schema  is  an  abbreviation  for  a  list  of  axiom  schemata  each  of  which  adds  the  ordering 
constraint  i  <  j  to  a  scope  forest.  Unfortunately,  schemata  of  this  kind  do  not  always  hold: 
they  can  only  be  used  if  the  logic  can  express  some  form  of  uncertainty. 

A  number  of  proposals  for  formalizing  the  process  of  choosing  among  preferences  have 
been  made,  among  which  belief  nets  [Pearl,  1988]  and  logics  in  which  it  is  possible  to  order 
the  propositions  in  the  knowledge  base  according  to  some  principle  [Nebel,  1990].  It’s  not 
yet  clear  whether  any  of  these  proposals  is  appropriate  for  my  purposes. 

8.3  Tense  and  Scope  Ambiguities 

The  parser  used  in  TRAINS  produces  a  logical  form  in  which  not  only  quantified  NP’s, 
but  also  certain  operators  are  unscoped,  among  which  negation  and  tense  operators^^.  A 
scoping  algorithm  is  then  applied,  and  then  the  logical  form  is  translated  into  Episodic 
Logic,  the  logic  developed  by  Len  Schubert  and  Chung-Hee  Hwang  [Schubert  and  Hwang, 
1990]  by  a  final  step  in  which  tense  operators  are  eliminated  and  replaced  by  formulas  which 
contain  episodic  variables  using  an  additional  contextual  structure,  Tense  Trees  [Schubert 
and  Hwang,  1990]. 

In  order  to  test  whether  the  scope  forest  idea  can  be  used  in  practice  to  represent  scope 
ambiguities  other  than  those  between  NP's,  I  plan  to  study  how  to  integrate  the  tense  tree 
interpretation  mechanism  with  the  other  scope  ambiguity  resolution  processes,  so  that  it 
won’t  be  necessary  to  decide  upon  a  scoping  before  applying  the  tense  interpretation  rules. 

8.4  Other  Issues 

A  question  that  needs  to  be  addressed  in  more  detail  is  the  relation  between  scope 
ambiguities  and,  for  example,  collective/distributive  ambiguities,  in  order  to  see  better  the 
relation  between  this  approach  and,  for  example,  ihe  approach  proposed  by  Bunt.  The 
reference  interpretation  model  also  needs  to  be  extended.  I’m  in  particular  interested  in 
studying  sentences  like  Every  chess  set  comes  with  a  spare  pawn.  IVs  taped  to  the  bottom., 
in  which  the  interface  between  reference  and  scope  disambiguation  seems  totally  different 
from  the  one  presented  in  the  examples  in  section  5.4. 


‘‘The  logical  form  is  essentially  that  proposed  in  [Schubert  and  Pelletier,  1982]. 
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